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TITLE OF THE INVENTION 

NUCLEIC ACID SEQUENCES ENCODING NOVEL POINT MUTATIONS ON mGliiR2 
AND mGluR3, POLYPEPTIDES WITH SAID MUTATIONS, AND METHODS OF USING 
SAID NUCLEIC ACID SEQUENCES, AND SAID POLYPEPTIDES, TO IDENTIFY, 
5 PREDICT AND EVALUATE SPECIFIC, SELECTIVE MODULATORS WHOSE 
ASSOCIATION TO mGlu2 OR mGlrRS IS EFFECTED BY SAID MUTATIONS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims tilie benefit of U.S. Provisional Application No. 
10 60/60/409,910, filed September 9, 2002, the contents of which are incorporated herein by 
reference in their entirety. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Non applicable. 

15 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

BACKGROUND OF THE INVENTION 

20 In the mammalian central nervous system (CNS), the transmission of nerve 

impulses is controlled by the interaction between a neurotransmitter, released by a sending 
neuron, and a surface receptor on a receiving neuron. This interaction causes excitation of the 
receiving neuron. L-Glutamate, which is the most abundant neurotransmitter in the CNS, 
mediates the major excitatory patihway in mammals, and is referred to as an excitatory amino 

25 acid (EAA). The receptors that respond to glutamate are called excitatory amino acid receptors 
(EAA receptors). See Watkins & Evans, Annual Reviews in Pharmacology and Toxicology, 
21:165 (1981); Monaghan, Bridges, and Cotman, Annual Reviews in Pharmacology and 
Toxicology, 29:365 (1989); Watkins, Krogsgaard-Larsen, and Honore, Transactions in 
Pharmaceutical Science, 11:25 (1990). The excitatory amino acids are of great physiological 

30 importance, playing a role in a variety of physiological processes, such as long-term potentiation 
(learning and memory), the development of synaptic plasticity, motor control, respiration, 
cardiovascular regulation, and sensory perception. 

Excitatory amino acid receptors are classified into two general families - 
ionotropic and metabotropic. lonotropic receptors are directly coupled to the opening of cation 

35 chaimels in the cell membrane of the neurons. This type of receptor has been subdivided into at 
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least four subtypes, which are defined by the depolarizing actions of the selective agonists y- 
aminobutyric acid (GABA), N-methyl-D-aspartate (NMD A), a-aniino-3-hydroxy-5- 
inethyUsoxazole-4-propionic acid (AMP A), and nicotinic acetylcholine (ACh). Kandel et al.. 
Principles of Neural Science, 3d ed, Elsevier Science Press, 1991. 
5 Metabotropic, the second genial EAA receptor family, includes the G-protein 

(guanosine nucleotide-binding protein) or second messenger-linked "metabotropic" excitatory 
amino acid receptor (also known as a "G Protein-Coupled Receptof , or a "GPCR"). This 
second type is coupled to multiple second messenger systems that lead to enhanced 
phosphoinositide hydrolysis, activation of phospholipase D, increases or decreases in cAMP 

10 formation, or changes in ion channel function. Schoepp and Conn, Trends in Pharmacological 
Science, 14:13 (1993). Both types of receptors appear not only to mediate normal synaptic 
transmission along excitatory pathways, but also participate in the modification of synaptic 
coimections during development and throughout life. Schoepp, Bockaert, and Sladeczek, Trends 
in Pharmacological Science, 11:508 (1990); McDonald and Johnson, Brain Research Reviews, 

15 15:41 (1990). 

The excessive or inappropriate stimulation of excitatory amino acid receptors 
leads to neuronal cell damage or loss by way of a mechanism known as excitotoxicity. This 
process has been suggested to mediate neuronal degeneration in a variety of conditions. The 
medical consequences of such neuronal degeneration makes the abatement of these degenerative 

20 neurological processes an important therapeutic goal. In that one effect of a positive mGluR2 
modulator (e.g., an mGluR2 potentiator) is to decrease activity at a synapse near mGluR2 
receptors (by stimulating mGluR2 activity), the development of specific positive modulators for 
mGluR2 appears to be an appropriate approach to control neuronal pathways and, in doing so, 
reducing neural cell damage. A key to the specificity, and expected greater effectiveness, of 

25 such modulators, whether for mGluR2, mGluR3, or other GPCRs, is to be able to evaluate and 
identify modulators that, at a pharmacologically acceptable and effective dose, have a positive 
effect on one such GPCR, while not having the same positive effect on other GPCRs which have 
opposite or undesired effects on the cell. 

In this regard, it is noted that the metabotropic glutamate receptors ("mGluRx," 

30 where x is an integer) are a highly heterogeneous family of glutamate receptors. All are linked to 
multiple second-messenger pathways. Presently at least eight identified subtypes of such 
receptors that fall into three classes based on second-messenger association, sequence homology, 
and agonist selectivity. See BOND, A. et al., '"Neuroprotective Effects of LY3 79268, a Selective 
mGlu2/3 Receptor Agonist: Investigations into Possible Mechanism of Action In Vivo", JPET, 

35 2000, 800-809: 294, USA. Also see "G Protein-Coupled Receptor Allosterism and 
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Complexing" Pharmacological Reviews, 54:2, 323-374. These receptors fimction to modulate 
the presynaptic release of glutamate, and the postsynaptic sensitivity of the neuronal cell to 
glutamate excitation. It is generally recognized that agonists and antagonists of these receptors 
may he useful for the treatment of acute and chronic neurodegenerative conditions, and as 
5 antipsychotic, anticonvulsant, analgesic, anxiolytic, antidepressant, and anti-emetic agents. The 
present invention discloses at least one allosteric binding site which would be useful for the 
identification and development of mGluR2- and mGluR3-specific modulators that associate with 
such site. 

In recent years, the role of allosteric modulators for the metabotropic glutamate 
10 receptors has been a subject of increasing interest and research. An advantage of developing a 
specific allosteric modulator for one type of metabotropic glutamate receptor is that the use of 
such modulator may provide a desirable pharmacological and behavioral result at lower doses, 
with less toxicity and side effects due to its specificity. 

The present invention advances the art by identifying a site of allosteric 
15 modulation in mGluR2 and mGluR3, and by characterizing critical components of said site, 
through point mutations, where such mutations have a dramatic effect on potentiation of the 
glutamate cell receptor. 

SUMMARY OF THE INVENTION 

20 The present invention provides novel forms of mGluR2 and mGluR3 comprising 

at least one allosteric binding site which has been modified by one, two, or three single amino 
acid point mutations. The loss of potentiation of receptors bearing such mutations in mGluR2 is 
clear and dramatic. The present invention provides nucleic acid sequences, expressible in 
isolated cells, useful for the identification of novel modulators and for the rational development 

25 of specific modulators of metabotropic glutamate recq)tors. The present invration also provides 
for polypeptides encoded by the herein disclosed isolated nucleic acid molecules that will find 
use in the identification of novel modulators and for the rational development of specific 
modulators of metabotropic glutamate receptors. The identification of the herein described 
nucleic acid and their respective mutant gene products will aid in the characterization and 

30 treatment of physiological disorders is hereby fiirthered. 

In particular, the present invention advances the art by identifying at lease one site 
of allosteric modulation in mGluR2 and mGluR3, by characterizing the said site through such 
mutations. These mutations result in a dramatic effect on potentiation by modulators at said site. 
In particular, certain modulators that have a dramatic effect on wild type of mGluR2 are shown 
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to have far less effect on variant mGluR2 receptors in which one or more such point mutations 
are introduced. Critically, such effect is not observed by the other specific subtype, mGluR3. 

The present invention also provides for assays to identify and determine the 
efficacy and reaction profile of modulators acting at such newly identified allosteric binding 
5 sites, which proves usefiil in the treatment or prevention of disorders associated with an excess or 
deficiency in the amount of glutamate present. Modulators developed with a high specificity to a 
single type of metabotropic G-Protein Receptor is expected to have greater utility and efficicacy 
because: 1) as a modulator, it can function at lower concentrations than an agonist or antagonist; 
and 2) with great specificity, a modulator can precisely target the desired metabotropic G-Protem 
10 Receptor without affecting the activity of related receptors having different, potentially undesired 
cellular effects. This is perceived to provide for development of modulators whose use in a 
patient in need thereof results in less overall toxicity to said patient 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 Figure 1 provides a summary chart of the normalized response to glutamate in 

HEK-293 cells transfected with wild-type mGluR2 receptors, with some treatments exposed to 
two mGluR2 modulators, A and B. 

Figure 2 provides a summary chart of the normalized response to glutamate in 
HEK-293 cells transfected with a mutant mGluR2 receptor of the present invention, with some 
20 treatments exposed to two mGluR2 modulators, A and B. 

Figure 3 provides a summary chart of the normalized response to glutamate in 
HEK-293 cells transfected with another mutant mGluR2 receptor of the present invention, with 
some treatments exposed to two mGluR2 modulators, A and B, 

Figure 4 provides a summary chart of the normalized response to glutamate in 
25 HEK-293 cells transfected with another mutant mGluR2 receptor of the present invention, with 
some treatments exposed to two mGluR2 modulators, A and B. 

Figure 5 provides a summary chart of the normalized response to glutamate in 
HEK-293 cells transfected with another mutant mGluR2 receptor of the present invention, with 
some treatments exposed to two mGluR2 modulators, A and B. 
30 Figure 6 provides a summary chart of the normalized response to glutamate in 

HEK-293 cells transfected with another mutant mGluR2 receptor of the present invention, with 
some treatments exposed to two mGluR2 modulators, A and B. 
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DETAILED DESCRIPTION OF THE INVENTION 
Definitions and Terms of Art 

The terms and abbreviations used in this document have their normal meanings 
unless otherwise designated. For example "°C" refers to degrees Celsius; "N" refers to noraial or 
5 normality; "mmol" refers to millimole or millimoles; "g" refers to gram or grams; "ml" means 
milliliter or milliliters; "M" refers to molar or molarity; "jig" and "ug" refer to microgram or 
micrograms; "pM" and **uM" refer to microMole or micromoles; and "jal" and •'ur refer to 
microliter or microliters. It is noted that some definitions are located outside this section. 

All nucleic acid sequences, unless otherwise designated, are written in the 
10 direction firom tiie 5' end to the 3' end, firequently referred to as "5' to 3"*. 

All amino acid or protein sequences, unless otherwise designated, are written 
commencing with the amino terminus ("N-terminus") and concluding with the carboxy terminus 
("C-terminus"). 

"Base pan:" or "bp" as used herein refers to DNA or RNA. The abbreviations 

15 A,C,G, and T correspond to the 5'-monophosphate forms of the deoxyribonucleosides 

(deoxy)adenosine, (deoxy)c3^idine, (deoxy)guanosine, and (deoxy)thymidine, respectively, when 
they occur in DNA molecules. The abbreviations U,C,G, and A correspond to the 5 - 
monophosphate forms of the ribonucleosides urodine, cytidme, guanosine, and adenosine, 
respectively when they occur in RNA molecules. In double stranded DNA, base pair may refer 

20 to a pauing of A with T or C with G. In a DNA/RNA, heteroduplex base pair may refer to a 
pairing of A with U or C with G. (See the definition of "complementary", infra.) 

The terms "cleavage" or "restriction digestion" of DNA refers to the catalytic 
cleavage of the DNA with a restriction enzyme that acts only at certain sequences in the DNA 
("sequence-specific endonucleases"). The various restriction enzymes used herein are 

25 commercially available and their reaction conditions, cofactors, and otiier requirements were 
used as wovdd be known to one of ordinary skill in the art. Appropriate buffers and substrate 
amounts for particular restriction eraymes are specified by the manufacturer or can be readily 
found in the literature. 

"Ligation" refers to the process of forming phosphodiester bonds between two 

30 nucleic acid fragments (T. Mardatis, et al., supra., p. 146). Unless otherwise provided, ligation 
may be accomplished using known buffers and conditions with a DNA ligase, such as T4 DNA 
ligase. 

The term "plasmid" refers to an extrachromosomal (usually) self-replicating 
genetic element. Plasmids are generally designated by a lower case "p" followed by letters 
35 and/or numbers. The starting plasmids herein are either commercially available, publicly 



-5- 



wo 2004/024936 



PCT/US2003/028470 



available on aa imrestricted basis, or can be constructed from available plasmids in accordance 
with published procedures. In addition, equivalent plasmids to those described are known in the 
art and will be apparent to the ordinarily skilled artisan. 

The term "reading frame" means tiie nucleotide sequence from which translation 
5 occurs "read" in triplets by the translational apparatus of transfer RNA (tRNA) and ribosomes 
and associated factors, each triplet corresponding to a particular amino acid. A fiameshift 
mutation occurs when a base pair is inserted or deleted from a DNA segment. When this occurs, 
the result is a different protein from that coded for by tiie DNA segment prior to the frameshift 
mutation. To insure against this, the triplet codons corresponding to the desired polypeptide 
10 must be aligned in multiples of three from the initiation codon, i.e. the correct "reading frame" 
being maintained. 

"Recombinant DNA cloning vector" as used herein refers to any autonomously 
replicating agent, including, but not limited to, plasmids and phages, comprising a DNA 
molecule to which one or more additional DNA segments can or have been added. 

15 The term "recombinant DNA expression vector" as used herein refers to any 

recombinant DNA cloning vector in which a promoter and other regulatory elements to control 
transcription of the inserted DNA. 

The term "expression vector system" as used herein refers to a recombinant DNA 
expression vector in combination with one or more trans-acting factors that specifically influence 

20 transcription, stability, or replication of the recombinant DNA expression vector. The trans- 
acting factor may be expressed from a co-transfected plasmid, virus, or other extrachromosomal 
element, or may be expressed from a gene integrated within the chromosome. 

"Transcription" as used herein refers to the process whereby information 
contained in a nucleotide sequence of DNA is transferred to a complementary RNA sequence. 

25 The term "transfection" as used herein refers to the taking up of an expression 

vector by a host cell whether or not any coding sequences are in fact expressed. Numerous 
methods of transfection are known to the ordinarily skilled artisan, for example, lipofectamine, 
calcium phosphate co-precipitation, and electroporation. Successfiil transfection is generally 
recognized when any indication of the operation of this vector occurs within the host cell. 

30 The term "transformation" as used herein means the introduction of DNA into an 

organism so that the DNA is replicable, either as an extrachromosomal element or by 
chromosomal integration. Methods of transforming bacterial and eukaryotic hosts are well 
known in the art, many of which methods are summarized in J. Sambrook, et al., "Molecular 
Cloning: A Laboratory Manual" (1989), 
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The term "translation" as used herein refers to the process whereby the genetic 
information of messenger RNA is used to specify and direct the synthesis of a polypeptide chain. 

The term "vector" as used herein refers to a nucleic acid compound used for the 
transformation of cells with polynucleotide sequences corresponding to appropriate protein 
5 molecules which when combined with appropriate control sequences confer specific properties 
on the host cell to be transformed. Plasmids, viruses, and bacteriophage are suitable vectors. 
Artificial vectors are constructed by joining DNA molecules from different sources. The term 
"vector" as used herein includes Recombinant DNA cloning vectors and Recombinant DNA 
expression vectors. 

10 The terms "complementary" or "complementarity" as used herein refers to the 

pairing of bases, purines and pyrimidines, that associate through hydrogen bonding in double 
stranded nucleic acid. The following base pairs are complementary: guanine and cytosine; 
adenine and thymine; and adenine and uracil. 

"Isolated amino acid sequence" refers to any amino acid sequence, however 
15 constructed or synthesized, which is locationally distinct from the naturally occurring sequence. 

"Isolated DNA compound" refers to any DNA sequence, however constmcted or 
synthesized, which is locationally distinct from its natural location in genomic DNA. 

"Isolated nucleic acid compoxmd" refers to any RNA or DNA sequence, however 
constructed or synthesized, which is locationally distinct from its natural location. 
20 A "primer" is a nucleic acid fragment which functions as an initiating substrate 

for enzymatic or synthetic elongation. 

The term "promoter" refers to a DNA sequence which directs transcription of 

DNA to RNA. 

As used herein, a nucleic acid "probe" is single-stranded DNA or RNA, or analog 
25 thereof, that has a sequence of nucleotides that includes at least 14, preferably at least 20, more 
preferably at least SO, contiguous bases that are the same as or the complement of any 14 or more 
contiguous bases set forth in SEQ.ID.NO.:l. In addition, the entire cDNA-encoding region of 
the entire sequence corresponding to SEQ.ID.NO.:! may be used as a probe. 

Presently preferred probe-based screening conditions comprise a temperature of 
30 about 37°C, a formamide concentration of about 20%, and a salt concentration of about 5X 
standard saline citrate (SSC; 20X SSC contains 3M sodium chloride, 0.3M sodimn citrate, pH 
7.0). Such conditions will allow the identification of sequences which have a substantial degree 
of similarity with the probe sequence, without requiring perfect homology. 

"Hybridization" refers to the binding of complementary strands of nucleic acid 
35 (i.e., sense:antisense strands or probe: target-DNA) to each other through hydrogen bonds, similar 
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10 



to the bonds that naturally occur in chromosomal DNA. Stringency levels used to hybridize a 
given probe with target-DNA can be readily varied by those of skill in the art. 

The phrase "stringent hybridization conditioned" is used herein to refer to 
conditions imder which polynucleic acid hybrids are stable. As known to those of skill in the art, 
the stability of hybrids is reflected in the melting temperature (Tm) of the hybrids. Tm can be 

approximated by the formula: 



81.5^C..16.6(logio[Na+])+0.41(%G+C)-600/l, 



where 1 is the length of the hybrids in nucleotides. Tm decreases approximately 1°-1.5**C with 
every 1% decrease in sequence homology. In general, the stability of a hybrid is a function of 
sodium ion concentration and temperature. Typically, the hybridization reaction is performed 
under conditions of lower stringency, followed by washes of varying, but higher, stringency. 

15 Reference to hybridization stringency relates to such washing conditions. 

As used herein, the phrase "moderately stringent hybridization" refers to 
conditions that permit target-DNA to bind a complementary nucleic acid that has about 60% 
identity, preferably about 75% identity, more preferably about 85% identity to the target DNA; 
with greater than about 90% identity to target-DNA being especially preferred. Preferably, 

20 moderately stringent conditions are conditions equivalent to hybridization in 50% formamide, 
5X Denharf s solution, 5X SSPE, 0.2% SDS at 42^C, followed by washmg in 0.2X SSPE, 0.2% 
SDS,at65°C. 

The phrase "high stringency hybridization" refers to conditions that permit 
hybridization of only those nucleic acid sequences that form stable hybrids in 0.01 8M NaCl at 
25 65^*0 (i.e., if a hybrid is not stable in 0.01 8M NaCl at 65°C, it will not be stable under high 

stringency conditions, as contemplated herein). High stringency conditions can be provided, for 
example, by hybridization in 50% formamide, 5X Denhart's solution, 5X SSPE, 0.2% SDS at 
42*^C., followed by washing in O.IX SSPE, and 0.1% SDS at 65^C. 

The phrase "low stringency hybridization" refers to conditions equivalent to 
30 hybridization in 10% formamide, 5X Denhart's solution, 6X SSPE, 0.2% SDS at 42°C., followed 
by washing in IX SSPE, 0.2% SDS, at 50°C. 

The term "antigenically distinct" as used herein refers to a situation in which 
antibodies raised against an epitope of the proteins of the present invention, or a fragment 
thereof, may be used to differentiate between the proteins of the present invention and other 
35 glutamate receptor subtypes. This term may also be employed in the sense that such antibodies 
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may be used to differentiate between the mutant human mGluR2 and mGluR3 receptor proteins 
and analogous proteins derived from other species. 

The term "PGR" as used herein refers to the widely known polymerase chain 
reaction employing a thermally-stable polymerase. 
5 Further, as used herein, "modulator" may be any molecule, compound, or any 

other composition which is suspected of being capable of modulating the rate or other 
substantive characteristic of binding of glutamate to an mGluRx receptor in vivo or m vitro by 
acting at a site (an allosteric site) on the receptor that is not the agonist binding site. 
•^Modulators" that are screened in the present invention can be any substances that are generally 

10 screened in the pharmaceutical industry during the drug development process. The substances 
may be macromolecules, such as biological polymers, including proteins, polysaccharides, 
nucleic acids, or the like. More usually, a substance will be a small molecule having a molecular 
weight below about 2 kD, more usually below 1.5 kD, frequently below 1 kD, and usually in the 
range from 100 to 1,000 D, and even more usually in the range from 200 D to 750 D. One or 

15 more substances may be pre-selected based on a variety of criteria. For example, suitable 
substances may be selected based upon SAR analysis based upon the calculated or predicted 
three-dimensional structures of the allosteric binding site discovered hereiu. Alternatively, the 
substances may be selected randomly and tested by the screening methods of the present 
invention. Substances which are able to up-modulate or down-modulate glutamate binding to an 

20 mGluRx receptor in vitro are considered as candidates for further screening of their ability to 
affect the activity of the tested mGluRx ia cells and/or animals. Substances are often tested in 
the methods of the present invention as large collections of substances, e.g. libraries of low 
molecular weight organic compounds, peptides, or natural products. 

In the scientific literature, the temi "allosteric" has taken somewhat different 

25 meanings with different scopes (see "G Protein-Coupled Receptor Allosterism and Complexing" 
Pharmacological Reviews, 54:2, 323-374, at 326-327). As used herein, "allosteric" is taken to 
mean a site other than flie ligand binding site where a non-ligand molecule (one that is neither an 
agonist nor an antagonist at the reactive binding site of the molecule) may bind, and wherein 
such binding affects the binding or the ligand at the ligand binding site (as expressed by rate, 

30 association constant, affinity, etc.). In contrast to some definitions in the field, as used herein the 
binding of a moleciile at the ligand binding site need not affect the binding of a molecule at the 
allosteric site. An "allosteric modulator" is a modulator, as defined above, that binds to an 
allosteric site on a molecule of interest. It is noted that the same molecule may act as an 
allosteric modulator on one protein or receptor, and act as an agonist or antagonist with regard to 

35 a second protein or receptor. 
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With regard to the effect of a mutation of an mGluRx polypeptide as described 
herein, the tem:is "depotentiate(s)" and "depotentiation" are taken to mean that tihie effect of such 
mutation results in a measurable decrease in the effect of a modulator in combination with said 
mutant polypeptide in comparison to a non-mutated wild-type mGluRx of the same species and 
5 sub^e. Preferably, the measurable decrease is substantial as that term is defined herein. 

Likewise, the terms "potentiate(s)" and "potentiation" are taken to mean that the effect of such 
mutation results in a measurable increase in the effect of a modulator in combination with said 
mutant polypeptide in comparison to a non-mutated wild-type mGluRx of the same species and 
subtype. Preferably, the measurable increase is substantial as that term is defined herein. 

10 A "structure activity relationship" (SAR) refers to the relationship between a 

given chemical structure or series of chemical structures and the pharmacological activity that 
series of compounds has on the given target or action of the compound. Compoxmds can be 
classed together based on a number of characteristics including but not limited to such structural 
characteristics as shape, size, stereochemical arrangement, and distribution of functional groups. 

15 Other factors contributing to structure- activity relationships can include chemical reactivity, 
electronic effects, resonance, and inductive effects. The utilization of various assays that can 
differentiate between structural modifications that produce increased affinity or specificity of a 
iven compoimd on its target compared to the activity seen on closely related targets or on targets 
that may be associated with off-target or mwanted activity of the compounds. The refinement of 

20 this SAR process can lead to pharmacologically active compoxmds that are more specific and less 
toxic than the initially identified compoimd or series of compounds. 

A "conservative amino acid substitution*' refers to the replacement of one amino 
acid residue by another, chemically similar, amino acid residue. Examples of such conservative 
substitutions are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or 

25 metiiionine) for another; substitution of one polar residue for another polar residue of the same 
charge (e.g., arginine for lysine; glutamic acid for aspartic acid); substitution of one aromatic 
amino acid (tryptophan, tyrosine, or phenylalanine) for another. 

A "conservative anaino acid substitution" as defined above is but one type of 
variation of an amino acid sequence listing encompassed by the broader term, "conservatively 

30 modified variants thereof." For instance, the latter is taken to have the meaning ascribed to the 
term in M.P.E.P. § 2422.03, Eighth Edition, 2001, which can include, without being limited to 
this example, deletions such as "at the C-terminus by 1, 2, 3, 4, or 5 residues." Where 
appropriate within this specification, conservative amino acid substitutions that are known or 
reasonably predicted to not adversely alter the desired fimctionality of the novel sequences 

35 disclosed herein are disclosed. Such disclosed conservative amino acid substitutions are 
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considered to fall within the scope of the sequence listings that include the novel polypeptide 
sequences disclosed and claimed herein. The same principal of "conservatively modified 
variants" applies to nucleotide sequences as well, additionally taking into account the 
redundancy of codons for a particular amino acid, and the optimization of codons for expression 
S in particular species. 

For instance, but not meant to be limiting, an amino acid sequence or a nucleotide 
sequence is considered "identical" to a reference sequence if the two sequences are the same 
when aligned for maximum correspondence over a comparison window. Optimal alignment of 
nucleotide and amino acid sequences for aligning comparison window may be conducted by the 

10 local homology algorithm of Smith &Waterman, 1981, Adv. Appl. Math. 2:482, by the 

homology alignment algorithm of Needleman & Wunsch, 1970, J. Mol. Biol. 48:443, by the 
search for similarity method of Pearson & Lipman, 1988, Proc. Natl. Acad. Sci., U.S.A. 
85:2444-2448, by computerized implementations of these algorithms (GAP, BESFIT, FASTA, 
and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer 

15 Group, 575 Science Dr., Madison, WI), or by inspection. Such determination of identity can be 
considered to indicate a "conservatively modified variant" of a particular amino acid sequence or 
nucleotide sequence so long as the variant continues to function as a polypeptide or nucleic acid 
sequence according to the present invention. 

"Consists essentially," with respect to a mutant polypeptide, indicates that the 

20 reference sequence can be modified by N-terminal and/or C-terminal additions or deletions that 
do not cause a substantial decrease in the ability of the mutant polypeptide to fimction to affect 
die binding of an allosteric modulator at the site of the mutation compared to the reference 
sequence lacking such additions or deletions. An example of a deletion is the removal of an N- 
terminal methionine. 

25 A "substantial change" in die ability of die mutant mGluR2 and mGluRS proteins 

of the present invention to affect the modulation relative to a wild-type control, is defined to be a 
change of at least about 20%, more usually at least about 50%, preferably at least about 75%, and 
often at least about 90% or higher compared to the response of said wild-type control. This 
change may be a decrease or an increase relative to the wild-type control, e.g., a "substantial 

30 decrease" or a "substantial increase" per the definition of "substantial change" relative to such 
control. 

The term "antibody" refers to a polypeptide substantially encoded by an 
immunoglobulin gene or immunoglobulin genes, or fragments thereof, which specifically bind 
and recognize an analyte (antigen). The recognized immimoglobulin genes include the kappa, 
35 lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as the myriad 
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immunoglobulin variable region genes. Antibodies exist, e.g., as intact immunoglobulins or as a 
number of well-characterized fragments produced by digestion with various peptidases. These 
include, e.g.. Fab' and F(aby2 fragments. The term "antibody" also includes antibody fragments 
either produced by the modification of whole antibodies or those synthesized de novo using 
5 recombinant DNA methodologies, and fturther includes "humanized" antibodies made by 
conventional techniques. 

The term "inrntmnoassay" is an assay that utilizes an antibody to specifically bind 
an analyte. The immunoassay is characterized by the use of specific binding properties of a 
particular antibody to isolate, target, and/or quantify the analyte. 

10 An antibody "specifically binds to" or "is specifically immunoreactive with" a 

protein, polypeptide, or peptide when the antibody functions in a binding reaction which is 
determinative of the presence of the protein, polypeptide, or peptide in the presence of a 
heterogeneous population of proteins and other biologies. Thus, under designated immunoassay 
conditions, the specified antibodies bind preferentially to a particular protein, polypeptide, or 

15 peptide and do not bind in a significant amount to other proteins, polypeptides, or peptides 
present in the sample. Specific binding to a protein, polypeptide, or peptide imder such 
conditions requires an antibody that is selected for specificity for a particular protein, 
polypeptide, or peptide. As used herein, the term "recognize" as it regards an antibody's 
association to an a particular protein, polypeptide, or peptide, or an epitope therein, is taken to 

20 mean that said antibody "specifically binds to" or "is specifically immunoreactive with that 
protein, polypeptide, or peptide." 

A variety of immunoassay formats may be used to select antibodies specifically 
immunoreactive with a particular protein, polypeptide, or peptide. For example, solid-phase 
ELISA immunoassays are routinely xised to select monoclonal antibodies specifically 

25 immunoreactive with a protem, polypeptide, or peptide. See Harlow & Lane, 1988, Antibodies, 
A Laboratory Manual, Cold Spring Harbor Publications, New York, for a description of 
immunoassay formats and conditions that can be used to determine specific immunoreactivity. 

"Transfection" refers to any of the methods known in the art for introducing DNA 
mto a cell, for example, but not Umited to, the methods of lipofectamine, calcixmi phosphate or 

30 calcium chloride mediated transfection, electroporation, and infection with a retroviral vector. 

Experiments Leading to Invention 

The identification of the site of action of the mGIuR2 potentiators was initiated 
based on the fact that compounds had been identified that positively modulated mGluR2 in the 
35 presence of ECiQ of glutamate but had no potentiator activity on mGluR3 (patent references, 
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Lilly and Merck). The subtype specificity and selectivity of these compounds implied that a 
subset of the amino acid residues that differed between mGluR2 receptor and mGluR3 were most 
likely involved in the specific binding of the compound. Initial alignments of the amino acid 
sequences of human mGluR2 (hmGluR2) and human mGluR3 (hmGluR3) revealed an overall 
5 amino acid homology of 66.3%, In order to identify the site of action of the mGluR2 positive 
modulators, various regions that differed between the hmGluR2 and hmGluR3 receptors were 
identified and selected for chimeric substitution between sequences contamed in hmGluR2 and 
hmGluRS, 

The initial region selected for chimeric substitution was a region in the amino 

10 terminal domain of hmGluR2 that was 5' of transmembrane one (Arginine-425 to Glutamic 
Acid-569). This domain contained a very cysteine rich region. It had recently been rq)orted 
(Schweitzer et al. (2000)) that zinc selectively inhibited the agonist activity of glutamate on 
mGluR2 compared to mGluR3. This apparent subtype selectivity of the effect of zmc on 
mGluR2 compared to mGluR3 could be due to the presence of a zinc finger domain on mGluR2 

15 that was not present on mGluR3. Schweitzer et al. (2000) had not identified the site of action of 
this zinc effect. We targeted the substitution of the cysteine rich region of mGluR2 for the 
homologous region in hmGlxiRS, since it has been widely reported that zinc binds to various 
cysteine and histidine motifs. Clearly the effect seen by the mGluR2 potentiators was the inverse 
effect seen by zinc however, it was thought that identification of the site of action of this zinc 

20 effect might lead to identification/understanding of the region involved in the potentiator effect. 

After the construction of the cysteine-rich chimera was initiated, a series of 
subsequent chimeric molecules were made to various regions of the transmembrane domains of 
hmGlxiR2 and hmGluR3. The rational for these substitutions was based on the fact that many 
sites of action for agonists or antagonist have been localized on various subtypes of G-protein 

25 coupled receptors (GPCRs) to particular transmembrane domains of these receptors (insert 

references). However, to our knowledge, the site of action of a receptor specific potentiator had 
not been localized to a transmembrane domain. The site of action of a mGluRS-specific 
antagonist, MPEP, was recently localized to three specific residues contained in the metabotropic 
rec€5)tor subtype 5 cDNA, mGluR5a (insert reference) and since the metabotropic receptors 

30 represent a family of closely related GPCRs, it was thought that a transmembrane binding 

domain(s) could be mvolved in the binding of the mGluR2 potentiators. A series of chimeric 
constmcts were subsequently generated that exchanged regions from TMl-3, TM3-5 or a single 
amino acid substitution in TM7 firom hmGluR2 with homologous regions of hmGluR3. 
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Detailed Disclosure of Invention, Protocols, and Applications 

First, it is noted that the disclosure herein, of mutant forms of mGluR2 and 
mGluR3, describes and specifies sequences for human forms of these mutant receptors. 
However, it is appreciated by those skilled in the art that the invention likewise applies to 
5 orthologues of such mutant forms in other species, including but not limited to murine species, in 
particular the rat and mouse, and also for monkey, ape, and other higher primates. Also, 
chimeric forms incorporating the novel sequences are prepared by combining genetic material 
from two or more species, for instance, combining a mutant form of human mGluR2 into a rat 
model genome. 

10 In certain embodiments, the invention provides a mutant polypeptide, where the 

mutant polypeptide comprises substitution of at least one, two or three amino acids at positions 
688, 689 and 735 relative to the wild-type human mGluR2 molecule, and specifically comprising 
any one, two in combination, or all three of the amino acid substitutions indicated below: 

- for mGluR2: leucine for serine at position 688; valine for glycine at position 
15 689; and aspartic acid for asparagine at position 735. 

- for mGluR3: serine for leucine at position 688; glycine for valine at position 
689; and asparagine for aspartic acid at position 735. 

That is, based on the research described in the section above, it was learned that 
the substitutions of the amino acids in wild-type mGluR3 at homologous positions 688, 689 and 

20 735, into mGluR2, substantially aflfected the potentiation of the resxiltant mutant mGluR2 

molecules when said molecules were evaluated in the presence of an mGluR2 modulator that 
appears to bind to an allosteric binding site involving these amino acid positions. It is noted that 
said positions 688 and 689 are recognized to reside in transmembrane region 4 (TM4) of 
mGluR2, and said position 735 is recognized to reside in transmembrane region 5 (IM5) of 

25 mGluR2. While not being bound to a particular theory, it is hypothesized that, for mGluR2, the 
substitution of larger and/or more acidic amino acids at such positions results in a configurational 
change in an allosteric binding site situated in or near TM4 and TM5. It is further hypothesized 
that such configurational change makes binding by certain allosteric modulators at such site 
more difficult, or less effective, resulting in a depotentiation compared to a wild-type mGluR2. 

30 Likewise, while not being boimd to a particular theory, it is postulated that the 

"reverse" substitutions, of the wild-type mGluR2 amino acids at these positions in mGluR3, to 
form mGluR3 mutants, results in potentiation by allosteric modulators at an analogous site on the 
mutant mGluR3. Further, although this disclosure may state that modulators bind to an allosteric 
site associated with the amino acids at the above three positions, it is emphasized that tiiis is 

35 merely based on a theory hypothesized from the evidence at hand. Critically, the novelty and the 
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Utility of the inventioii disclosed herein, particularly the mGluR2 and mGluR3 mutants and 
related methods, are not bound to fliis particular theory. Instead, the invention relies on the 
merits of the discoveries and results presented herein, independently of the ultimate accuracy of 
any of the above hypotheses and theories. 
5 Particularly, this invention provides mutant mGluR2 polypeptide sequences 

selected from the group consisting of SEQ.ID.NOS.: 1-8, and conservatively modified variants of 
such sequences. The data presented herein, particularly in Figure 4 (see below), indicates that 
four single amino acid mutations together provide a substantial decrease in the effect of two 
allosteric modulators. However, it is believed that the three mutations noted above are the 

10 mutations with the greater effect relative to the fourth mutation, A733T. This belief is based on 
certain data (not shown) that appears to indicate that the A733T mutation (threonine replacing 
alanine at position 733) by itself provides an effect similar to that of a wild-type control. In 
contrast, the testing of the other single mutations by themselves (see Fig. 3 for N735D, other data 
not shown) indicates a greater difference from the wild-type control. 

15 In other embodiments, the invention provides an isolated nucleic acid or a nucleic 

acid compound that comprises a nucleic acid sequence which encodes for a mutant polypeptide, 
where the mutant polypeptide comprises a substitution of any one, two in combination, or all 
three of the amino acid substitutions indicated below: 

for mGluR2: leucine for serine at position 688; valine for glycine at position 689; 

20 and aspartic acid for asparagine at position 735. 

for mGluR3: serine for leucine at position 688; glycine for valine at position 
689; and asparagine for aspartic acid at position 735. 

Particularly, this invention provides isolated nucleic acid sequences selected from 
the group consisting of SEQ.ID.NOS.:9-16, and conservatively modified variants of such 

25 sequences. 

In the above description of the mutations of mGluR3, for convenience of this 
disclosure, the same nmnbering used for mGluR2 is used for the mGluR3. It is noted that there 
is an offset of nine amino acids in the numbering. For example, where a substitution at position 
688 in mGluR3 is noted above, the actual position on the mGluR3 molecule is at position 697. 
30 Skilled artisans will recognize that the polypeptide, peptides, and fusion proteins 

of the present invention can be synthesized by any number of different methods. The amino acid 
compoimds of the invention can be made by chemical methods well known in the art, including 
solid phase peptide synfliesis or recombinant methods. Both methods are described in U.S. Pat. 
No. 4,617,149, incorporated herein by reference. 
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The principles of solid phase chemical synthesis of polypeptides are well known 
in the art and may be found in general texts in the area. See. e.g., H. Dugas and C. Penney, 
Bioorganic Chemistry (1981) Springer-Verlag, New York, 54-92. For example, peptides may be 
synthesized by solid-phase methodology utilizing an Applied Biosystems 430A peptide 
5 synthesizer (commercially available from Applied Biosystems, Foster City Calif.) and synthesis 
cycles supplied by Applied Biosystems. Protected amino acids, such as t-butoxycarbonyl- 
protected ammo acids, and other reagents are commercially available from many chemical 
supply houses. 

Sequential t-butoxycarbonyl chemistry using double couple protocols are applied 
10 to the starting p-methyl benzhydryl amine resins for the production of C-terminal carboxamides. 
For the production of C-terminal acids, the corresponding pyridine-2-aldoxime methiodide resin 
is used. Asparagine, glutamme, and arginine are coupled using preformed hydroxy benzotriazole 
esters. The following side chain protection may be used: 
Arg, Tosyl 
15 Asp, cyclohexyl 

Glu, cyclohexyl 
Ser, Benzyl 
Thr, Benzyl 

Tyr, 4-bromo carbobenzoxy 

20 

Removal of the t-butoxycarbonyl moiety (deprotection) may be accomplished 
with trifluoroacetic acid (TFA) in methylene chloride. Following completion of the synthesis the 
peptides may be deprotected and cleaved from the resin with anhydrous hydrogen fluoride 
containing 10% meta-cresol. Cleavage of the side chain protecting group(s) and of the peptide 

25 from the resin is carried out at zero degrees Celcius or below, preferably -20''C for thirty minutes 
followed by thirty minutes at 0*^C. 

After removal of the hydrogen fluoride, the peptide/resin is washed witb ether, 
and the peptide extracted with glacial acetic acid and then lyophilized. Purification is 
accomplished by size-exclusion chromatography on a Sephadex G-10 (Pharmacia) column in 

30 10% acetic acid. 

The proteins of the present invention altematively are produced by recombinant 
methods. Recombinant methods are preferred if a high yield is desired. A general method for 
the construction of any desired DNA sequence is provided in J, Brown, et al.. Methods in 
Enzymology, 68:109 (1979), See also, J. Sambrook, et al., supra. 

35 The basic steps in the recombinant production of desired proteins are: 
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a) construction of a natural, synthetic or semi-synthetic DNA encoding the 
protein of interest; 

b) integrating said DNA into an expression vector in a manner suitable for 
the expression of the protein of interest, either alone or as a fusion protein; 

5 c) transforming an appropriate eukaryotic or prokaiyotic host cell with said 

expression vector, 

d) culturing said transformed or transfected host cell in a manner to express 
flie protein of interest; and 

e) recovering and purifying the recombinantly produced protein of interest. 

10 

In general, prokaryotes are used for cloning of DNA sequences and constructing 
the vectors of the present invention. Prokaryotes may also be employed in the production of the 
protein of interest. For example, the Escherichia coli K12 strain 294 (ATCC No. 3 1446) is 
useful for the prokaryotic expression of foreign proteins. Other strains of E. coli (and their 

15 relevant genotypes) are well known and conmionly used in the art. 

Further, in addition strains of E. coli, bacilli such as Bacillus subtilis, other 
enterobacteriaceae such as Salmonella typhimurium or Serratia marcescans, and various 
Pseudomonas species may be used. In addition to these gram-negative bacteria, other bacteria, 
especially Streptomyces, spp., may be employed in the prokaryotic cloning and expression of the 

20 proteins of this invention. 

Thus, suitable B. coli strains, as well as many other suitable prokaryote species 
and strains are known in the art, and are commercially available from suppliers such as: Bethesda 
Research Laboratories, Gaithersburg, Md. 20877 and Stratagene Cloning Systems, La Jolla, 
Calif. 92037; or are readily available to the public from sources such as tiie American Type 

25 Culture Collection, 12301 Parklawn Drive, Rockville, Md., 10852-1776. U.S. Patents 6,017,697 
and 6,387,655 disclose additional information regarding suitable species for use, and are 
incorporated by reference generally and for this specific purpose. 

Except where otherwise noted or recognized in the art, the bacterial strains can be 
used interchangeably. Any particular type of bacterial host stated herein is not meant to limit the 

30 invention in any way. Genotype designations are in accordance with standard nomenclature. 
See, for example, J. Sambrook, et al., supra. 

Promoters suitable for use with prokaryotic hosts include the p-lactamase [vector 
pGX2907 (ATCC 39344) contains the replicon and p-lactamase gene] and lactose promoter 
systems [Chang et al., Nature (London), 275:615 (1978); and Goeddel et al.. Nature (London), 

35 281:544 (1979)], alkaline phosphatase, the tryptophan (trp) promoter system [vector pATHl 
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(ATCC 37695) is designed to facilitate expression of an open reading jframe as a trpE fixsion 
protein under control of the tip promoter] and hybrid promoters such as the tac promoter 
(isolatable from plasmid pDR540 ATCC-37282). However, other functional bacterial 
promoters, whose nucleotide sequences are generally known, enable one of skill in the art to 
5 ligate them to DNA encoding the proteins of the instant invention using linkers or adapters to 
supply any required restriction sites. Promoters for use in bacterial systems also will contain a 
Shine-Dalgamo sequence operably linked to the DNA encoding the desired polypeptides. These 
examples are illxistrative rather than linuting. 

The proteins of this invention may be synthesized either by direct expression or as 

10 a fusion protein comprising the protein of interest as a translational fusion with anotiier protein 
or peptide which may be removable by enzymatic or chemical cleavage. It is often observed in 
the production of certain peptides in recombinant systems that expression as a fusion protein 
prolongs the lifespan, increases the yield of the desired peptide, or provides a convenient means 
of purifying the protein of interest. A variety of peptidases (e.g. enterokinase and thrombin) 

15 which cleave a polypeptide at specific sites or digest the peptides from the amino or carboxy 
termini (e.g. diaminopeptidase) of the peptide chain are known. Furthermore, particular 
chemicals (e.g. cyanogen bromide) will cleave a polypeptide chain at specific sites. The skilled 
artisan will appreciate the modifications necessary to the amino acid sequence (and synthetic or 
semi-synthetic coding sequence if recombinant means are employed) to incorporate site-specific 

20 internal cleavage sites. See e.g., P. Carter, "Site Specific Proteolysis of Fusion Proteins", Chapter 
13 in Protein Purification: From Molecular Mechanisms to Large Scale Processes, American 
Chemical Society, Washington, D.C. (1990). 

In addition to cloning and e2q)ressing tiie genes of interest in the prokaryotic 
systems discussed above, the protems of the present invention may also be produced in 

25 eukaryotic systems. The present invention is not limited to use in a particular eukaryotic host 
cell. A variety of eukaryotic host cells are available from depositories such as the American 
Type Culture Collection (ATCC) and are suitable for use with the vectors of the present 
invention. The choice of a particular host cell depends to some extent on the particular 
expression vector used to drive expression of the human glutamate receptor-encoding nucleic 

30 acids of the present invention. 

A preferred host cell line employed in this invention is the widely available cell 
line HEK293 (hereinafter referred to as "HEK293" or "293"). This cell line is available from the 
American Type Culture Collection imder the accession number CRL-1573. 

Cell lines, such as HEK293, produce glutamate endogenously. As a result, some 

35 amounts of glutamate are secreted into the culture medium, thereby making it somewhat difQcult 
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to express and study glutamate receptors in such cell lines due to the activation of the transfected 
receptor. However, measures can be taken to reduce the effects of this glutamate release on the 
functional characterization of the receptor. These include but are not limited to the use of 
neurotransport protem. For example, cell lines can be produced to express a plasmid in which 
5 the rat glutamate transporter gene (GLAST) is expressed. The glutamate levels in 24 hour 

meditmi of a cell line expressing GLAST will reduce the amount of glutamate in the medium to 
less than 3 micromolar, thus reducing the basal activation of the receptor and/or desensitation or 
the requirement for extensive washing to remove residual glutamate before assay procedures. 
See Storck, et al, Proc. Natl Acad. Sci. USA, 89:10955-59 (Nov. 1992) and Desai et al, 

10 Molecular Pharmacology, 48:648-657 (1995). Other measures also are helpful in dealing with 
the endogenous glutamate production. For instance, plating the cells on fibronectin coated plates 
and exchanging out the media 16 hours before the assay is performed are helpful in reducing and 
normalizing the efiFects of the release of glutamate to the media. 

In addition to the lipid transfection method for incorporating the novel nucleic 

15 acid sequences (see Example 2 below), a wide variety of vectors, some of which are discussed 
below, exist for the transformation of manmialian host cells such as those described above. 
Thus, the present invention also relates to vectors which comprise a polynucleotide or 
polynucleotides of the present invention, and host cells which are genetically engineered with 
vectors of the invention and to the production of polypeptides of the invention by recombinant 

20 techniques. In another embodiment, the present invention relates to a recombinant DNA 

molecule comprising, 5' to 3', a promoter effective to initiate transcription in a host cell and the 
above-described invention nucleic acid molecule(s). 

Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. 

25 Incorporation of cloned DNA into a suitable e}q)ression vector, transfection of 

eukaryotic cells with a plasmid vector or a combination of plasmid vectors, each encodmg one or 
more distinct genes or with linear DNA, and selection of transfected cells are well known m the 
art (see, e.g., Sambrook et al. (1989) Molecular Clomng: A Laboratory Manual, Second Edition, 
Cold Spring Harbor Laboratory Press). Suitable means for introducing (transducing) expression 

30 vectors containing invention nucleic acid constructs into host cells to produce transduced 

recombinant cells (i.e., cells containing recombinant heterologous nucleic acid) are well known 
in the art (see, for review, Friedmann, 1989, Science, 244:1275-1281; MulHgan, 1993, Science, 
260:926-932, each of which are incorporated herein by reference in their entirety). 

As refeired to above, exemplary methods of transduction include, e.g., infection 

35 employing viral vectors (see, e.g., U.S. Pat. No. 4,405,712 and 4,650,764), calcium phosphate 
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transfection (U.S. Pat. Nos. 4,399,216 and 4,634,665), dextran sulfate transfection, 
electroporation, lipofection (see, e.g., U.S. Pat. Nos. 4,394,448 and 4,619,794), cytofection, 
particle bead bombardment, and the like. The heterologous nucleic acid can optionally include 
sequences which allow for its extrachromosomal (i.e., episomal) maintenance, or the 
heterologous nucleic acid can be donor nucleic acid that integrates mto the genome of the host. 
Recombinant cells can then be cultured under conditions whereby the mutant mGluR2 and 
mGluRB sequences encoded by the DNA is (are) expressed. Preferred cells include TnaTnmaHaTi 
cells (e.g., HEK 293, CHO and Ltk- cells), yeast cells (e.g., methylotrophic yeast cells, such as 
Pichia pastoris), bacterial cells (e.g., Escherichia coli), and the like. 

Expression vectors for use in carrying out the present invention will comprise a 
promoter capable of directing the transcription of a cloned DNA and a transcriptional terminator. 

Also contained in the expression vectors is a polyadenylation signal located 
downstream of the coding sequence of interest. Polyadenylation signals include the early or late 
polyadenylation signals from SV40 (Kaxifitnan and Sharp, ibid.), the polyadenylation signal from 
the Adenovirus 5 ElB region and the human growth hormone gene terminator (DeNoto et al., 
Nuc. Acid Res. 9: 3719-3730, 1981). The expression vectors may include a noncoding viral 
leader sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and the 
RNA splice sites. Preferred vectors may also include enhancer sequences, such as the SV40 
enhancer and the mouse |x enhancer (Gillies, Cell 33: 717-728, 1983). Expression vectors may 
also include sequences encoding the adenovirus VA RNAs. 

Suitable expression vectors are well known in the art, and include vectors capable 
of expressing DNA operatively linked to a regulatory sequence, such as a promoter region that is 
capable of regulating expression of such DNA. Thus, an expression vector refers to a 
recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or olher 
vector that, upon introduction into an appropriate host cell, results in expression of the inserted 
DNA. Appropriate e;q)ression vectors are well known to those of skill in the art and include 
those that aire replicable in eukaryotic cells and/or prokaryotic cells and those that remain 
episomal or those which integrate into the host cell genome. 

Exemplary expression vectors for transformation of E. coli prokaryotic cells 
include the pET expression vectors (Novagen, Madison, Wis., see U.S. Pat. No. 4,952,496), e.g., 
pETUa, which contains the T7 promoter, T7 terminator, the inducible E. coli lac operator, and the 
lac repressor gene; and pET 12a-c, which contains the T7 promoter, T7 terminator, and the E. 
coli ompT secretion signal. Another such vector is the pIN-niompA2 (see Duffaud et al., Meth. 
in Enzymology, 153:492-507, 1987), which contains the Ipp promoter, the laclJV5 promoter 
operator, the ompA secretion signal, and the lac repressor gene. 
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Exemplary eukaryotic expression vectors include eukaryotic cassettes, such as the 
pSV-2 gpt system (Mulligan et al., 1979, Nature, 277:108-1 14); the Okayama-Berg system (Mol. 
Cell BioL, 2:161-170), and the e:?q)ression-cloning vector described by Genetics Institute (1985, 
Science, 228:810-815). Each of these plasmid vectors is capable of promoting expression of the 
5 invention chimeric protein of interests 

Also provided are nucleic acid molecule(s) comprising a transcriptional region 
functional in a cell, a sequence complmientary to an RNA sequence encoding an amino acid 
sequence corresponding to the herein-disclosed mutant mGluR2 and mGluR3 sequences, and a 
transcriptional termination region functional in a suitable host cell. 

10 A wide variety of transcriptional and translational regulatory sequences may be 

employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine papilloma 
virus, cytomegalovirus, simian virus, or the like, where the regulatory signals are associated with 
a particular gene sequence which has a high level of expression. Alternatively, promoters from 

15 mammalian expression products, such as actin, collagen, myosin, and the like, may be employed. 
Transcriptional initiation regulatory signals may be selected which allow for repression or 
activation, so that expression of the gene sequences can be modulated. Of interest are regulatory 
signals which are temperature-sensitive so that by varying the temperature, expression can be 
repressed or mitiated, or are subject to chemical (such as metabolite) regulation. A favored 

20 promoter is the promoter from the Cytomegalovirus (CMV). 

Thus, an embodiment provides are transformed host cells that recombinantly 
express the herein disclosed mutant mGluR2 and mGluRS sequences of the invention. 

As used herein, a cell is said to be "altered to express a desired peptide" when tihie 
cell, through genetic manipulation, is made to produce a protein which it normally does not 

25 produce or which the cell normally produces at lower levels. One skilled in the art can readily 
adapt procedures for introducing and expressing either genomic, cDNA, or synthetic sequences 
into either eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
polypeptide if it contains nucleotide sequences which contain transcriptional and translational 

30 regulatory information and such sequences are "operably linked" to nucleotide sequences which 
encode the polypeptide. An operable linkage is a linkage in which the regulatory DNA 
sequences and the DNA sequence sought to be expressed are connected in such a way as to 
permit gene sequence expression. The precise nature of the regulatory regions needed for gene 
sequence expression may vary from organism to organism, but shall in general include a 

35 promoter region which, in prokaryotes, contains both the promoter (which directs the initiation 
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of RNA transcription) as well as the DNA sequences which, when transcribed into RNA, will 
signal synthesis initiation. Such regions will normally include those 5 -non-coding sequences 
involved with initiation of transcription and translation, such as the TATA box, capping 
sequence, CAAT sequence, and the like. 
5 If desired, the non-coding region 3* to the sequence encoding an mGluR2 or 

mGluR3 gene may be obtained by the above-described mefliods. This region may be retained for 
its transcriptional termination regulatory sequences, such as termination and polyadenylation. 
Thus, by retaining the 3 -region naturally contiguous to the DNA sequence encoding a mutant 
mGluR2 and mGluR3 sequences, the transcriptional termination signals may be provided. 

10 Where the transcriptional termination signals are not satisfactorily functional in the expression 
host cell, then a 3' region functional in the host cell may be substituted. 

Two DNA sequencers (such as a promoter region sequence and an a mutant 
mGluR2 or mGluR3 encoding sequence) are said to be "operably linked" if the nature of the 
linkage between the two DNA sequences does not (1) result in the introduction of a jBrame-shift 

15 mutation, (2) interfere with the ability of the promoter region sequence to direct the transcription 
of the mutant mGluR2 or mGluR3 encoding gene sequence, or (3) interfere with the ability of 
the mutant mGluR2 or mGluR3 encoding sequence to be transcribed by the promoter region 
sequence. Thus, a promoter region would be operably linked to a DNA sequence if the promoter 
were capable of effecting transcription of that DNA sequence. 

20 The selection of control sequences, expression vectors, transformation methods, 

and the like, are dependent on the type of host cell used to express the gene. As used herein, 
"cell", "cell line", and "cell culture" may be used interchangeably and all such designations 
include progeny. 

The term "transformants" or "transformed cells" include the primary subject cell 
25 and cultures derived therefrom, without regard to the number of transfers. It is also understood 
that all progeny may not be precisely identical in DNA content, due to deliberate or inadvertent 
mutations. However, as defined, mutant progeny have the same functionality as that of the 
originally transformed cell. 

To express the human mutant mGlxiR2 and mGluR3 sequences-encoding gene of 
30 the invention, transcriptional and translational signals recognized by an appropriate host are 

necessary. The present invention encompasses the expression of the mutant mGluR2 or mGluR3 
sequences in either prokaryotic or eukaryotic cells. 

Host cells which may be used in the expression systems of the present invention 
are not strictly limited, provided that they are suitable for use in the expression of flie novel 
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human mutant mGluR2 and mGluRS sequences of the invention. Suitable hosts may often 
include eukaryotic cells, such as HEK-293 and other suitable mammalian cell lines. 

Also, as noted above, representative examples of appropriate host cells for use in 
practicmg the present invention include bacterial cells, such as streptococci, staphylococci, E. 
5 coli, Streptomyces and Bacillus subtilis cells. Also, fungal cells, such as yeast cells and 

Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera S& cells; animal cells such 
as CHO, COS, HeLa, C127, 3T3, BHK, HEK 293 and Bowes melanoma cells; and plant cells 
may be used. 

Fungal cells, including species of yeast (e.g., Saccharomyces spp., particularly S. 

10 cerevisiae, Schizosaccharomyces spp.) or filamentous fimgi (e.g., Aspergillus spp., Neurospora 
spp.) may be used as host cells within the present invention. Suitable yeast vectors for use in the 
present invention mclude YRp7 (Struhl et aL, Proc. Natl. Acad. Sci. USA. 76: 1035-1039, 1978), 
YEpl3 (Broach et al.. Gene 8: 121-133, 1979), POT vectors (Kawasaki et al, U.S. Pat No. 
4,931,373, which is incorporated by reference herein), pJDB249 and pJDB219 (Beggs, Nature 

15 275:104-108, 1978) and derivatives thereof Such vectors will generally include a selectable 
marker, which may be one of any number of genes that exhibit a dominant phenotype for which 
a phenotypic assay exists to enable transformants to be selected. Preferred selectable markers 
are those that complement host cell auxotrophy, provide antibiotic resistance or enable a cell to 
utilize specific carbon sources, and include LEU2 (Broach et al., ibid.), URA3 (Botstein et al., 

20 Gene 8: 17, 1979), HIS3 (Struhl et al., ibid.) or POTl (Kawasaki et al., ibid.). Another suitable 
selectable marker is the CAT gene, which confers chloramphenicol resistance on yeast cells. 

Any of a series of yeast gene sequence expression systems can be utilized which 
incorporate promoter and termination elements firom the actively expressed gene sequences 
coding for glycolytic enzymes are produced in large quantities when yeast are grown in mediums 

25 rich in glucose. Known glycolytic gene sequences can also provide very efficient transcriptional 
control signals. 

Yeast provides substantial advantages in that it can also carry out post- 
translational peptide modifications. A number of recombinant DNA strategies exist which 
utilize strong promoter sequences and high copy number of plasmids which can be utilized for 
30 production of the desired proteins in yeast. Yeast recognizes leader sequences on cloned 

mammalian gene sequence products and secretes peptides bearing leader sequences (i.e., pro- 
peptides). For a mammalian host, several possible vector systems are available for the 
expression of the mutant mGluR2 and mGluR3 sequences. 

A variety of higher eukaryotic cells may serve as host cells for expression of the 
35 polypeptides of the invention, although not all cell Imes will be capable of functional coupling of 
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the receptor to the cell's second messenger systems. Cultured mammalian cells, such as BHK, 
CHO, Yl (Shapiro et al., TIPS Suppl. 43-46 (1989)), NG108-15 (Dawson et al., Neuroscience 
Approached Through Cell Culture, Vol. 2, pages 89-1 14 (1989)), NlE-1 15 (Liles et al., J. Biol. 
Chem, 261:5307-5313 (1986)), PC 12 and COS-1 (ATCC CRL 1650) are preferred. Preferred 
5 BHK cell lines are the tlcsup.-- tsl3 BHK cell line (Waechter and Baserga, Proc. Natl. Acad. Sci. 
USA 79:1 106-1 1 10 (1982)) and the BHK 570 cell line (deposited with the American Type 
Culture Collection, 12301 Parldawn Dr., Rockville, Md. imder accession number CRL 103 14). 
A tik.sup.- BHK cell Ime is available from the ATCC xmder accession nxmiber CRL 1632. 

Prokaryotic hosts are, generally, very efficient and convenient for the production 

10 of recombinant proteins and are, therefore, one type of preferred expression system for the 
expressing the mutant mGluR2 and mGluR3 sequences encoding gene. 

Prokaryotes most frequently are represented by various strains of E. coli. 
However, other microbial strains may also be used, including other bacterial strains. In 
prokaryotic systems, plasmid vectors that contain replication sites and control sequences derived 

1 5 from a species compatible with the host may be used. Examples of suitable plasmid vectors may 
include pBR322, pUC-1 18, pUCl 19 and the like; suitable phage or bacteriophage vectors may 
include .gamma.gtlO, .gamma.gtl 1 and the like; and suitable virus vectors may include pMAM- 
neo, pKRC and the like. Preferably, the selected vector of the present invention has the capacity 
to replicate in the selected host cell. 

20 Recognized prokaryotic hosts include bacteria such as E. coli, Bacillus, 

Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the peptide will not be glycosylated. The prokaryotic host must be compatible with 
the replicon and control sequences in the expression plasmid. 

To e;q>ress the mutant mGluR2 and mGluR3 sequences (or a ftmctional derivative 

25 thereof) in a prokaryotic cell, it is necessary to operably link tiie mutant mGluR2 or mGluR3 
encoding nucleotide sequence to a functional prokaryotic promoter. Such promoters may be 
either constitutive or, more preferably, regulatable (i.e., inducible or derepressible). Examples of 
constitutive promoters and inducible promoters of well known to a skilled artisan. Prokaryotic 
promoters are reviewed by Cenatiempo (Biochimie 68:505-516 (1986)); and Gottesman (Ann. 

30 Rev. Genet. 18:415-442 (1984) ). Proper expression in a prokaryotic cell also requires the 
presence of a ribosome-binding site upstream of the gene sequence-encoding sequence. Such 
ribosome binding sites are disclosed, for example, by Gold et at.. Aim. Rev. Microbiol. 35:365- 
404(1981). 

As used herein, the term "promoter" refers to a polynucleotide sequence, 
35 preferably a DNA sequence, that regulates e3q>ression of a selected DNA sequence operably 
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linked to the promoter, and which effects expression of the selected DNA sequence in cells. The 
term encompasses "tissue specific" promoters, i.e. promoters, which effect expression of the 
selected DNA sequence only in specific cells (e.g. cells of a specific tissue). The term also 
covers so-called "leaky" promoters, which regulate expression of a selected DNA primarily in 
5 one tissue, but cause expression in other tissues as well. The term also encompasses non-tissue 
specific promoters and promotCTS that constitutively express or that are inducible (i.e. expression 
levels can be controlled). 

A mutant mGl\iR2 and mGluR3 sequences encoding nucleic acid molecule and an 
operably linked promoter may be introduced into a recipient prokaryotic or eukaryotic cell either 

10 as a nonreplicating DNA (or RNA) molecule, which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule. Since such molecules are incapable of 
autonomous replication, the expression of the gene may occur through the transient egression of 
the introduced sequence. Alternatively, permanent expression may occur through the integration 
of the introduced DNA sequence into the host chromosome, 

15 In one embodiment, a vector is employed which is capable of integrating the 

desired gene sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The marker 
may provide for prototrophy to an auxotrophic host, biocide resistance, e.g., antibiotics, or heavy 

20 metals, such as copper, or the like. The selectable marker gene sequence can either be directly 
linked to the DNA gene sequences to be expressed, or introduced into the same cell by co- 
transfection. Additional elements may also be needed for optimal synthesis of single chain 
binding protein mRNA. These elements may include splice signals, as well as transcription 
promoters, enhancers, and termination signals. cDNA expression vectors incoiporating such 

25 elements include those described by Okayama, Molec. Cell. Biol. 3:280 (1983). 

In a preferred embodiment, the introduced nucleic acid molecule will be 
incorporated into a plasmid or viral vector capable of autonomous replication in the recipient 
host. Any of a wide variety of vectors may be employed for this purpose. Factors of importance 
in selecting a particular plasndd or viral vector include: the ease with which recipient cells that 

30 contain the vector may be recognized and selected from those recipient cells which do not 

contain the vector; the number of copies of the vector which are desired in a particular host; and 
whether it is desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of 
replication in E. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184, .pi.VX. Such 

35 plasmids are, for example, disclosed by Sambrook (cf. Molecular Cloning: A Laboratory 
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Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor 
Laboratory, (1989)). Bacillus plasmids include pC194,pC221,pT127, and tlie like. Such 
plasmids are disclosed by Gryczan (In: The Molecular Biology of the Bacitli, Academic Press, 
N.Y. (1982), pp. 307-329). Suitable Streptomyces plasmids include pi JlOl (Kendall et al., J. 
5 Bacteriol. 169:4177-4183 (1987)), and streptomyces bacteriophages such as .phi.C31 (Chater et 
al.. In: Sixth International Symposiimi on Actinomycetales Biology, Akademiai Kaido, 
Budapest, Hungary (1986), pp. 45-54). Pseudomonas plasmids are reviewed by John et al. (Rev, 
Infect Dis. 8:693-704 (1986)), and Izaki (Jpn, J. Bacteriol. 33:729-742 (1978)), 

As noted, supra, e7q>ression of the mutant mGluR2 and mGluR3 sequences in 

10 eukaryotic hosts requires the use of eukaryotic regulatory regions. Such regions will, in general, 
include a promoter region suflBcient to direct the initiation of RNA synthesis. Preferred 
eukaryotic promoters include, for example, the C5^omegalovirus Promoter (CMV), the promoter 
of the mouse metallothionein I gene sequence (Hamer et al., J. Mol. Appl. Gen. 1:273-288 
(1982)); the TK promoter of Herpes virus (McKnight, CeU 31:355-365 (1982)); the SV40 early 

15 promoter (Benoist et aL, Nature (London) 290:304-3 10(1981)); the yeast gal4 gene sequence 
promoter (Johnston et al., Proc. Natl. Acad. Sci. (USA) 79:6971-7975 (1982); Silver et al., Proc. 
Natl. Acad. Sci. (USA) 81:5951-5955 (1984)). 

As is widely known, translation of eukaryotic mRNA is initiated at the colon 
which encodes the first methionine. For this reason, it is preferable to ensure that the linkage 

20 between a eukaryotic promoter and a DNA sequence which encodes the mutant mGliiR2 and 
mGluR3 sequences (or a functional derivative thereof) does not contain any intervening codons 
which are capable of encoding a methionine (i.e., AUG). The presence of such codons results 
eitiier in a formation of a fusion protein (if the AUG codon is in the same reading frame as the 
mutant mGluR2 or mGluRS coding sequence) or a frame-shift mutation (if the AUG codon is not 

25 in the same readmg frame as the mutant mGluR2 or mGluR3 coding sequence). 

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, SV40, 2- 
micron circle, and the like, or their derivatives. Such plasmids are well known in the art 
(Botstein et al., Miami Wntr. Symp. 19:265-274 (1982); Broach, In: The Molecular Biology of 
the Yeast Saccharomyces: Life Cycle and Inheritance, Cold Spring Harbor Laboratory, Cold 

30 Spring Harbor, N.Y., p. 445-470 (1981); Broach, Cell 28:203^204 (1982); BoUon et at, J. Ctin. 
Hematol. Oncol. 10:39-48 (1980); Maniatis, In: Cell Biology: A Comprehensive Treatise, Vol. 3, 
Gene Sequence Expression, Academic Press, N.Y., pp. 563-608 (1980), 

Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate host cell by 

35 any of a variety of suitable means, i.e., transformation, transfection, conjugation, protoplast 

-26- 



wo 2004/024936 



PCT/US2003/028470 



fusion, electroporation, particle gun technology, calciimi phosphate-precipitation, direct 
microinjection, and the like. After the introduction of the vector, recipient cells are grown in a 
selective medium, which selects for the growth of vector-containing cells. Expression of the 
cloned gene molecule(s) results in the production of the herem-disclosed mutant mGluR2 and 
5 mGluR3 sequences or biologically active jfragments thereof. This can take place in the 

transformed cells as such, or following the induction of these cells to differentiate (for example, 
by adnndnistration of bromodeoxyuracil to neuroblastoma cells or the like). 

A variety of incubation conditions can be used to form the peptide of the present 
invention. Preferred conditions include those which mimic physiological conditions. 

10 An example of the means for preparing the mutant mGluR2 and mGluRS 

sequences of the invention is to express nucleic acids encoding the mutant mGluR2 and mGluR3 
sequences in a suitable host cell, such as a bacterial cell, a yeast cell, an amphibian cell (i.e., 
oocyte), or a mammalian cell, using methods well known in the art, and recovering the expressed 
polypeptide, again using well-known methods. 

15 Using methods such as northern blot or slot blot analysis, transfected cells that 

contain mutant mGluR2 and mGluR3 sequences encoding DNA or RNA can be selected. 
Transfected cells can also be analyzed to identify those that express the mutant mGl\iR2 and 
mGluR3 sequences. Analysis can be carried out, for example, by using any of well known 
screening assays attending a functional receptor, and comparing the values obtained to a control, 

20 untransfected host cells by electrophysiologically monitoring the currents through the cell 

membrane in response to mutant mGluR2 and mGlxiR3 sequences, and the like. Mutant mGluR2 
and mGluR3 sequences(s) can be isolated directly fix)m cells that have been transformed with 
expression vectors comprising nucleic acid encoding the mutant mGluR2 and mGluR3 
sequences or fragments/portions thereof. 

25 Nucleic acid molecules may be stably incorporated into cells or may be 

transiently introduced using methods known in the art. Stably transfected mammalian cells may 
be prepared by transfecting cells with an expression vector comprising a sequence of nucleotides 
that encodes the mutant mGluR2 and mGluR3 sequences in conjunction with a selectable marker 
gene (such as, for example, the gene for thymidine kinase, dihydrofolate reductase, neomycin 

30 resistance, and the like), and growing the transfected cells imder conditions selective for cells 
expressing the marker gene. To prepare transient transfectants, mammalian cells are transfected 
with a reporter gene (such as the E. coli P-galactosidase gene) to monitor transfection efficiency. 
The precise amounts and ratios of DNA encoding the mutant mGluR2 and mGluR3 sequences 
may be empirically determined and optimized for a particular cells and assay conditions. 

35 Selectable marker genes are typically not included in the transient transfections because the 
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transfectants are typically not grown under selective conditions, and are usually analyzed within 
a few days after transfection. 

In order to identify cells that have integrated the cloned DNA, a selectable marker 
is generally introduced into the cells along with the gene or cDNA of interest. Preferred 
S selectable markers for use in cultured mammalian cells include genes that confer resistance to 
dmgs, such as neomycin, hygromycin, and methotrexate. The selectable marker may be an 
amplifiable selectable marker. Preferred amplifiable selectable markers are the DHFR gene and 
the neomycin resistance gene. Selectable markers are reviewed by Thilly (Mammalian Cell 
Technology, Butterworfh Publishers, Stoneham, MA, which is incorporated herein by reference). 

1 0 The choice of selectable markers is well within the level of ordinary skill in the art. 

Selectable markers may be introduced into the cell on a separate plasmid at the 
same time as the gene of interest, or they may be introduced on the same plasmid. If on the same 
plasmid, the selectable marker and the gene of interest may be under the control of dififerent 
promoters or the same promoter, the latter arrangement producing a dicistronic message. 

15 Constructs of this type are known in the art (for example, Levinson and Simonsen, U.S. Pat. No. 
4,713,339). It may also be advantageous to add additional DNA, known as "carrier DNA" to the 
mixture which is introduced into the cells. 

In particularly preferred aspects, eukaryotic cells which contain heterologous 
DNAs express such DNA and form recombinant mutant mGluR2 and mGluR3 sequences. In 

20 more preferred aspects, recombinant mutant mGluR2 and mGluR3 sequences activity is readily 
detectable because it is a type that is absent from the untransfected host cell. 

Heterologous DNA may be maintained in the cell as an episomal element or may 
be integrated into chromosomal DNA of the cell. The resulting recombinant cells may then be 
cultured or subciiltured (or passaged, in the case of mammalian cells) from such a culture or a 

25 subculture thereof. Methods for transfection, injection and culturing recombinant cells are 

known to the skilled artisan. Sinwlarly, ttie mutant mGluR2 and mGluRS sequences(s) may be 
purified using protein purification methods known to those of skill in the art. For example, 
antibodies or other ligands that specifically bind to the mutant mGluR2 and mGluRS sequences 
may be used for affinity purification of the mutant mGluR2 and mGluR3 sequences. 

30 As used herein, "heterologous or foreign DNA and/or RNA" are used 

interchangeably and refer to DNA or RNA that does not occur naturally as part of the genome of 
the cell in which it is present or to DNA or RNA which is found in a location or locations in the 
genome that differ from that in which it occurs in nature. Typically, heterologous or foreign 
DNA and RNA refer to DNA or RNA that is not endogenous to the host cell and has been 
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artificially introduced into the cell. Examples of heterologous DNA include DNA disclosed 
herein. 

In other embodiments, mRNA may be produced by in vitro transcription of DNA 
encoding the mutant mGluR2 and mGluR3 sequences. This mRNA can then be injected into 
5 Xenopus oocytes where the RNA directs the synthesis of the mutant mGluR2 and mGluR3 

sequences. Alternatively, the mvention-encoding DNA can be directly injected into oocytes for 
e>q)ression of a functional mutant mGluR2 and mGluR3 sequences. The transfected mammalian 
cells or injected oocytes may then be used in the methods of drag screening provided herein. 

Alternatively, the invention DNA sequences can be transcribed into RNA, which 

10 can then be transfected into amphibian cells for translation into protein. Suitable amphibian cells 
include Xenopus oocytes. 

Practitioners of this invention realize that, in addition to the above-mentioned 
expression systems, the cloned cDNA may also be employed in the production of transgenic 
animals in which a test mammal, usually a mouse, in which the effects of the expression of the 

15 proteins of the present invention can be assessed, particularly witii regard to the interaction of 
such proteins and modulators or portions of modulators. For instance, without being limited by 
such example, the nucleic acids of the present invention may be employed in the construction of 
"knockin'* animals in which the expression of a mutant mGluR2 or mGluR3 is expressed. Once 
such test animals are prepared, evaluations are conducted that evaluate the effects of modulator 

20 compoxmds. For instance, a modulator development program, based on rational drug design 

using SAR , may have a putative highly selective modulator design. Structural variations of this 
design, and the hypothetical optimal design, each are evaluated in a n\maiber of different types of 
knock-in mice, where each knock-in mouse type cairies a different mutation combination of the 
S688L, G689V and N735D mutants. This data is used to verify, refine the design of, and 

25 evaluate tiie putative highly selective modulator in a whole organism model. An important 
feature of such evaluations is the ability to evaluate and assess "oflF-targef * effects of the 
modulators, i.e., effects at other organs, or unexpected behavioral effects. 

Skilled artisans also recognize that some alterations of SEQ.ID.NOS.:l-8 will fail 
to change the fimction of the respective amino acid compound. For instance, some hydrophobic 

30 amino acids may be exchanged for other hydrophobic amino acids. Those altered amino acid 
compounds which confer substantially the same function in substantially the same manner as the 
exemplified amino acid compound are also encompassed within the present invention. Typically 
such conservative substitutions attempt to preserve the: (a) secondary or tertiary structure of the 
polypeptide backbone; (b) the charge or hydrophobicity of the residue; or (c) the bulk of the side 

35 chain. Some examples of such conservative substitutions of amino acids, resisting in the 
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production of proteins which may be functional equivalents of the protein of SEQ.ID.NOS.: 1-8 
are shown in TABLE H, infra. 



TABLE n 




Original Residue 


Exemplaiy Substitutions 


Ala 


Ser, Gly 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro, Ala 


His 


Asn, Gin 


He 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Mel 


Leu, He 


Phe 


Met, Leu, Gyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Ttp, Phe 


Val 


lie. Leu 



25 

These substitutions may be introduced into the protein in a variety of ways, such 
as during the chemical synthesis or by chemical modification of an amino acid side chain after 
the protein has been prepared. 

Alterations of the protein having a sequence which corresponds to the sequence of 

30 SEQ.ID.NOS.: 1-8 also may be induced by alterations of the nucleic acid compounds which 

encode these proteins, i.e., SEQ.ID.NOS.:9-16. These mutations of the nucleic acid compound 
may be generated by either random mutagenesis techniques, such as those techniques employing 
chemical mutagens, or by site-specific mutagenesis employing oligonucleotides. Those nucleic 
acid compounds which confer substantially the same function in substantially the same manner 

35 as the exemplified nucleic acid compoimds are also encompassed within die present invention. 
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Other embodiments of the present invention are nucleic acid compounds which 
comprise isolated nucleic acid sequences which encode any of the polypeptide sequences of 
SEQ.ID.NOS.:l-8. As skilled artisans will recognize, the amino acid compounds of the 
invention can be encoded by a multitude of different nucleic acid sequences because most of the 
amino acids are encoded by more than one nucleic acid triplet due to the degeneracy of the 
amino acid code. Because these alternative nucleic acid sequences would encode the same 
axnxao acid sequences, the present invention further comprises these alternate nucleic acid 
sequences. 

The nucleic acid sequences encoding the mutant mGluRjc receptor molecules of 
the present invention may be produced using syntiietic methodology. This syntiiesis of nucleic 
acids is well known in the art. See. e.g., E. L. Brown, R. Belagaje, M. J. Ryan, and H. G. 
Khorana, Methods in Enzymology, 68:109-151 (1979). The DNA segments corresponding to 
the receptor gene are generated using conventional DNA synthesizing apparatus such as the 
Applied Biosystems Model 380A or 380B DNA synthesizers (commercially available from 
Applied Biosystems, Inc., 850 Lincohi Center Drive, Foster City, Calif. 94404) which employ 
phosphoramidite chemistry. In the alternative, the more traditional phosphotriester chemistry 
may be employed to synthesize the nucleic acids of this invention. [(See. e.g., M. J. Gait, ed.. 
Oligonucleotide Synthesis, A Practical Approach, (1984).] 

Any of such synthetic hxmian mGluRjc nucleic acid sequences (e.g., synthetic 
genes) may be designed to possess restriction endonuclease cleavage sites at either end of the 
transcript to facilitate isolation from and integration into expression and amplification plasmids. 
The choice of restriction sites are chosen so as to properly orient the coding sequence of the 
receptor with control sequences to achieve proper in-fiame reading and expression of the human 
mGluRx receptor molecule. A variety of other such cleavage sites may be incorporated 
depending on the particular plasmid constructs employed and may be generated by techniques 
well known m the art. 

In an alternative methodology, tiie desired DNA sequences can be generated xising 
the polymerase chain reaction as described in U.S. Pat. No. 4,889,818, which is incorporated 
herein by reference. 

In addition to the deoxyribonucleic acid of SEQ.ID.NOS.:9-16, this invention also 
provides ribonucleic acids (RNA) that Ukewise express the mGluR2 mutants designated as 
SEQ.ID.NOS.: 1-8, and conservatively modified variants thereof. As is known to those of skill in 
the art, the DNA sequences provided herein are convertible to corresponding RNA sequences via 
pyrimidine base substitution of uracil (U) for thymine (T). 



-31- 



wo 2004/024936 



PCT/US2003/028470 



The ribonucleic acids of the present invention may be prepared using the 
polynucleotide synthetic methods discussed supra or they may be prepared enzymatically using 
RNA polymerases to transcribe a DNA template. 

One system for preparing the ribonucleic acids of the present invention employs 
5 the RNA polymerase from the bacteriophage T7 or the bacteriophage SP6. Both of these KNA 
polymerases are highly specific and require the insertion of bacteriophage-specific sequences at 
the 5' end of the message to be read. See, J. Sambrook, et al., supra, at 18.82-18.84. 

This invention also provides nucleic acids, RNA or DNA, which are 
complementary to SEQ.ID.NOS.:9-16. 

10 The present invention also provides probes and primers useful for molecular 

biology techniques. A compound which encodes for SEQ.ID.NOS.:9-16, or a complementary 
sequence of SEQ.ID.NOS.: 9-16, or a fragment thereof, and which is at least 20-50 base pairs in 
length, and which will selectively hybridize to human genomic DNA or messenger RNA 
encoding a human glutamate receptor, is provided. Preferably, the 25 or more base pair 

15 compound is DNA or a length sufficient to hybridize. 

The term "selectively hybridize" as used herein may refer to either of two 
situations. In the first such embodiment of this invention, the nucleic acid compounds described 
supra hybridize to a human glutamate receptor tmder more stringent hybridization conditions 
than these same nucleic acid compoimds would hybridize to an analogous glutamate receptor of 

20 another species, e.g. rodent. In the second such embodiment of this invention, these probes 
hybridize to the mutant mGluR2 or mGluR3 receptor molecules under more stringent 
hybridization conditions than other related compounds, including nucleic acid sequences 
encoding other glutamate receptors of the same species. A more detailed explanation of 
hybridization and vaiying stringency of hybridization, which pertain to the two types of 

25 hybridization described here, is provided in the "Definitions and Terms of Art" section supra. 

These probes and primers can be prepared enzymatically as described supra. 
These probes and primers also are synthesized using chemical means as described supra. 
Alternately, these probes and primers of defined stmcture may also be purchased commercially. 

Another aspect of the present invention is recombinant DNA cloning vectors and 

30 expression vectors comprising the nucleic acid sequences of the present invention. Many of the 
vectors encompassed within this invention are described above. The preferred nucleic acid 
vectors are those which are DNA. A preferred recombinant DNA vector comprises the isolated 
DNA sequence SEQ.ID.NO.:14, bearing the codons for all three single amino acid substitutions 
on the respective mutant mGluR2 sequence. This sequence has been shown to provide the 

35 strongest depotentiation. However, it is fully appreciated that when using the compositions of 
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the present invention to help identify desired structures and sequences for effective modulators 
of mGluR2, the isolated nucleic acid sequences encoding a mutant mGluR2 with a single amino 
acid substitution, e.g., SEQ.ID.NO.:8-10, and the sequences encoding a mutant mGluR2 with 
two amino acid substitutions, e.g., SEQ,ID.NO.: 11-13, are valuable when used in concert. Any 
5 of these are suitable for transfection into cells by use of vectors, as described herein, and by other 
means of transfection, such as by lipid transfection as in Example 2 below. 

The skilled artisan understands that the type of cloning vector or expression 
vector employed depends upon the availability of appropriate restriction sites, the type of host 
cell in which the vector is to be transfected or transformed, the purpose of the transfection or 

10 transformation (e.g., transient expression in an oocyte system, stable transformation as an 

extrachromosomal element, or integration into the host chromosome), the presence or absence of 
readily assayable markers (e.g., antibiotic resistance markers, metabolic markers, or &e like), 
and the number of copies of the gene to be present in the cell. 

The type of vector employed to carry the nucleic acids of the present invention 

15 may be RNA vimses, DNA viruses, lytic bacteriophages, lysogenic bacteriophages, stable 
bacteriophages, plasmids, viroids, and the like. The most preferred vectors of the present 
invention are those derived from plasmids. 

When preparing an expression vector the skilled artisan imderstands that there are 
many variables to be considered. One such example is the use of a constitutive promoter, i.e. a 

20 promoter which is functional at all times, instead of a regulatable promoter which may be 

activated or inactivated by the artisan using heat, addition or removal of a nutrient, addition of an 
antibiotic, and the like. The practitioner also understands that the amount of nucleic acid or 
protein to be produced dictates, in part, the selection of the expression system. For experiments 
examining the amount of the protein expressed on the cell membrane or for experiments 

25 examining the biological function of an expressed membrane protein, for example, it may be 

unwise to employ an expression system which produces too much of the protein. The addition or 
subtraction of certain sequences, such as a signal sequence preceding the coding sequence, may 
be employed by the practitioner to influence localization of the resulting polypeptide. Such 
sequences added to or removed from the nucleic acid compounds of the present invention are 

30 encompassed within this invention. 

The plasmid pcDNA3.1 can be readily modified to construct expression vectors 
that produce mutant mGluR2 and mGluRS receptors of the present invention in a variety of cells, 
including, for example, HEK-293. 

One of the most widely employed techniques for altering a nucleic acid sequence 

35 is by way of oligonucleotide-directed site-specific mutagenesis. B. Comack, "Current Protocols 
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in Molecular Biology", 8.01-8.5.9, (F. Ausubel, et al., eds. 1991). In this technique an 
oligonucleotide, whose sequence contains the mutation of interest, is synthesized as described 
supra. This oligonucleotide is then hybridized to a template containing the wild-type sequence. 
In a most preferred embodiment of this technique, the template is a single-stranded template. 
5 Particularly preferred are plasmids which contain regions such as the fl intergenic region. This 
region allows the generation of single-stranded templates when a helper phage is added to the 
culture harboring the "phagemid". 

After the annealing of the oligonucleotide to the template, a DNA-dependent 
DNA polymerase is then used to synthesize the second strand from the oliognucleotide, 

10 complementary to the template DNA. The resulting product is a heteroduplex molecule 

containing a mismatch due to the mutation in the oligonucleotide. After DNA replication by the 
host cell a mixture of two types of plasmid are present, tiie wild-type and the newly constructed 
mutant. This technique permits the introduction of convenient restriction sites such that the 
coding sequence may be placed immediately adjacent to whichever transcriptional or 

15 translational regulatory elements are employed by the practitioner. 

The construction protocols utilized for E. coli can be followed to construct 
analogous vectors for other organisms, merely by substituting, if necessary, the appropriate 
regulatory elements using techniques well known to skilled artisans. Also, construction 
protocols for eukaryotic cells are widely known and employed by those skilled in the art. 

20 Host cells which harbor the nucleic acids provided by the present invention are 

also provided. A preferred host cell is an Xenopus sp. oocyte which has been injected with RNA 
or DNA compounds of the present invention. Most preferred oocytes of the present invention 
are those which harbor a sense mRNA of tiie present invention. Other preferred host cells 
include HEK-293 cells which have been transfected and/or transformed with a vector which 

25 connprises a nucleic acid of the present invention. 

The present invention also provides a method for constructing a recombinant host 
cell capable of expressing any of SEQ.ID.NOS.:9-16, said method comprismg transforming a 
host cell with a recombinant DNA vector that comprises an isolated DNA sequence which 
encodes any of SEQ.ID.NOS.:9-16. The preferred host cell is HEK-293. A preferred vector for 

30 expression is one which comprises SEQ.ID.N0.:16. A preferred host cell for this method is 
HEK-293. An especially preferred expression vector in HEK-293 is one which comprises 
SEQ.ID.NO.:16. 

Transformed host cells may be cultured under conditions well known to skilled 
artisans such tiiat any of SEQ.ID.NOS.:9-16 is expressed, thereby producing a mutant of 
35 mGluR2 m the recombinant host cell, where such mutant alters the potentiation compared to 
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wild type mGluR2 with respect to modulators that normally bind to at least one allosteric binding 
site associated with the fliree amino acid substitutions believed to provide the most substantial 
alteration in effect. 

Likewise, transformed host cells may be cultured under conditions well known to 
skilled artisans such that any of SEQ.ID.NOS.:41-48 is expressed, thereby producing a mutant of 
mGluR3 in the recombinant host cell, where such mutant sltets the potentiation compared to 
wild type mGluR3 with respect to modulators that normally bind to at least one allosteric binding 
site associated with the three amino acid substitutions believed to provide the most substantial 
alteration in effect. Also with regard to mGluR3, mutant peptides of SEQ.ID.NOS..:33-40 are 
produced and used in manners disclosed herein for mutant mGluR2 peptides. With regard to the 
use of such mGIuRB mutant peptides and the nucleic acid sequences aicoding them, it is 
believed that the polypeptide configuration of at least some of such nmtants interacts with an 
allosteric modulator having its site in the general TM4/TM5 region in such a way as to increase 
the effect of such modulator. 

The ability of glutamate to bind to an mGluRx receptor is essential in the 
development of a multitude of indications. In identifying and developing substances that act as 
allosteric modulators of a particular mGluIU receptor, such as mGluR2, it would be desirable, 
therefore, to determine those substances that bind to the allosteric binding site described herein. 
Generally, such an assay includes a method for determining whether a substance non- 
competitively affects the activity of the designated mGluRx receptor, and more specifically such 
an assay compares the effect of said substance on glutamate reception in assay systems using 
wild type mGluRx, and at least one of the mutant mGluRxs of the present invention. Typically a 
control of wild-type mGluRxr with a known amount of glutamate, i.e., 1 mM, serves as a positive 
control against which other responses are normalized. For instance, a purported up-modulator of 
mGluR2 that acts at the allosteric binding site discovered herein is expected to raise the response 
with a small glutamate quantity (e.g.. 3 or 10 pM) in wild-type mGluR2, but is expected to have 
a diminished tqp-modulation effect in a treatment havmg a mutant mGluR2 where tiie mutant has 
at least one of any of the following amino acid substitutions: S688L; G689V; and N735D. This 
difference in modulation can be observed by comparing the effect of modulator Compounds A 
and B in Figure 1 with the lower effect of such modulators in Figures 2-6. 

The instant invention provides such a screenmg system useful for discovering 
agents which allosterically modulate glutamate binding to the mGluR2 (or, alternately, the 
mGluR3) receptor, said screeniag system comprising the steps of: 

a) preparing a human mutant mGluR2 mutant receptor, as with any of the 
sequences SEQ.ID.NOS.:9-16, and obtaining or preparing a wild-type human mGluR2 receptor; 
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b) transforming cells to express, separately, the mutant and wild-type 
mGluR2 receptors prepared in "a)"; 

c) introducing glutamate at desired levels in each treatment and control; 

d) exposing said human mGluR2 mutant and wild-type receptors (expressed 
5 per the transformations in "b)" above) to a potential allosteric modulator of the mGluR2 receptor 

complex (where said potential allosteric modulator may bind to the site(s) affected by the 
mutation in the mutant mGluR2); and, 

e) quantifying the relative degree of responses among the treatments such as 
compared to a positive control provided a higher level of glutamate. 

10 This allows one to rapidly screen for allosteric modulators of the 

glutamate/HmGluR2 receptor complex. This screening approach also is applied to mGluR3. 
Utilization of the screening system described above provides a sensitive and rapid means to 
determine compounds which modulate mGluR2 but not mGluR3 to the same extent, or vice 
versa. For example, a preferred modulator identified with the present invention modulates 
15 mGluR2 at a pharmacologically effective dose, but its impact on modulation of mGluR3 is 
small, for instance, an up-modulation of less than 20 percent, whereas the up-modulation for 
mGluR2 by said modulator is over 100 percent change under the same test conditions. This 
increase in specificity afforded by the present invention is seen to result in decrease toxicity by 
increasing targeted effectiveness at lower doses and eliminating undesired stimulation of non- 
20 targeted mGluRx receptors. 

Another use of such mutant forms of mGl\iR2 and mGlxiRS is to use as "negative 
controls" in studies that evaluate whether or not a modulator binds to the corresponding wild- 
type mGluR2 or mGluR3. In such studies, strong binding to mGluR2 wild-type receptors and 
substantially less or no binding to mGluR2 mutants at the 688, 689, and/or 735 sites indicates 
25 that said modulator has good specificity to an allosteric modulator site on wild-lype mGluR2 
(where the mutations disrupt the association that provides for the modulator's effect). It further 
suggests that there would not be cross-over effect to modulate mGluR3 at a corresponding site 
on mGluR3, 

The following table provides a sxmunary of desired levels of changes in the 
30 targeted mGluRc compared to the mGluIU that is most affected by the modulator but which is 
not the targeted subtype: 
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For a "positive'* modulator 


- approx. 




For a "negative" modulator 


- approx. 


percent change compared to no 




percent change compared to no 


modulator 








modulator 






Levels of 
Modulator 
Specificity 
(arbitrary scale) 


Targeted 
mGluRx 


Non-targeted 
mGluRx most 
affected by 
modulator 




Levels of 
Modulator 
Specificity 
(arbitrary scale) 


Targeted mOluRx 


Non-targeted 
mGluRx most 
affected by 
modulator 


1 


+ 25% 


+15% 




1 


- 25% 


-15% 


2 


+25% 


+10% 




2 


-25% 


-10% 


3 


+50% 


+15% 




3 


-50% 


-15% 


4 


+50% 


+10% 




4 


-50% 


-10% 


5 


+100% 


+10% 




5 


-100% 


-10% 


6 


+200% 


+15% 




6 


-200% 


-15% 


7 


+500% 


+25% 




7 


-500% 


-25% 


8 


>500% 


<50% 




8 


<500% 


>50% 



For example, considering a negative modulator targeted for mGluR2, where 
mGluR3 is the non-targeted mGluRx with the highest response of all non-targeted, identified 
5 mOluRxs, a level 4 response ("good") is when the modulator provides about a 50% decrease in 
glutamate response for mGluR2, and only about a 10% decrease in glutamate response for 
mGluR3. A better profile for a modulator, in that it is more specific, is a level 5 response, where 
the modulator provides about a 100% decrease in glutamate response for mGluR2, and about flie 
same 10% decrease in glutamate response for mGluR3. 

Screening systems, such as based on the above method, may also be adapted to 
automated procedures such as a PANDEX.RTM. (Baxter-Dade Diagnostics) system allowing 
for efBcient high-volume screening of potential allosteric modulator therapeutic agents. 

It is also noted that an oocyte transient egression system can be constructed 
according to the procedure described in S. Lubbert, et al. Proceedings of the National Academy 
15 of Sciences (USA), 84:4332 (1987). 

In an especially preferred embodiment of this invention an assay measuring the 
production of phosphoinositides was performed. The production of phosphoinositides is known 
to positively correlated with the addition of glutamate to cells containing certain types of 
metabotropic receptors that are coupled to Gq or to Gs or Gi Imked metabotropic GPCRs that are 
20 co-expressed with various promisceous g-proteins that now couple these Gs or Gi linked 
receptors to Gq-linked second messenger systems. 
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In an especially preferred embodiment of this invention an assay measuring the 
inhibition of forskolin-stimulated cAMP synthesis was performed. The inhibition of cAMP 
synthesis is known to positively correlated with the addition of glutamate to cells containing 
certain types of metabotropic receptors. 
5 In anotiier embodunoat this invention provides a method for identifying, in a test 

sample, DNA homologous to a probe of the present invention, wherein the test nucleic acid is 
contacted with the probe undar hybridizing conditions and identified as being homologous to the 
probe. Hybridization techniques are well known in the art See, e.g., J. Sambrook, et al., supra, 
at Chapter 11. 

10 The nucleic acid compounds of the present invention may also be used to 

hybridize to genomic DNA which has been digested with one or more restriction enzymes and 
run on an electrophoretic gel. The hybridization of radiolabeled probes onto such restricted 
DNA, usually fixed to a membrane after electrophoresis, is well known in the art. See, e.g., J. 
Sambrook, supra. Such procedures may be employed m searching for persons with mutations in 

1 5 these receptors by the well-known techniques of restriction fragment length polymorphisms 

(RPLP), tiie procedures of which are described in U.S. Pat. No. 4,666,828, issued May 19, 1987, 
the entire contents of which is incorporated herein by reference. 

The determination of the DNA base sequence of the hxmian genome is considered 
to have a major impact on biomedical science in the next century. It is believed that such 

20 determinations will enhance a range of applications from genetic mapping of disease-associated 
genes to diagnostic tests for disease susceptibility and drug response. Briefly, the determination 
of base composition at specific, variable DNA sites known as single nucleotide polymorphisms 
(SNPs) is especially important SNPs have a number of uses in mapping, disease gene 
identification, and diagnostic assays. All of these applications involve flie detennination of bases 

25 composition at the SNP site. 

Conventional techniques to determine base conq)osition at a single site include 
minisequencing (See, e.g., "Minisequencing: A Specific Tool For DNA Analysis And 
Diagnostics On Oligonucleotide Arrays," by Tomi Pastinen et al.. Genome Research 7, 606 
(1997)), and oligo-ligation (See, e.g., "Single-Well Genotypmg Of DialleUc Sequence Variations 

30 By A Two-Color ELISA-Based Oligonucleotide Ligation Assay," by Vincent O. Tobe et al.. 
Nuclear Acids Res. 24, 3728 (1996)). In minisequencing, a primer is designed to interrogate a 
specific site on a sample template, and polymerase is used to extend the primer with a labeled 
dideoxynucleotide. In oligo-ligation, a similar primer is designed, and ligase is used to 
covalentiy attach a downstream oligo that is variable at the site of interest In each case, ttie 

35 preference of an enzyme for correctly base-paured substrates is used to discriminate the base 
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identity that is revealed by the covalent attachment of a label to the primer. In most applications 
these assays are configured with the primer immobilized on a solid substrate, including 
microplates, magnetic beads and recently, oligonucleotides microarrayed on microscope slides. 
Detection strategies include direct labeling with fluorescence detection or indirect labeling using 
5 biotin and a labeled streptavidin with fluorescent, chemiluminescent, or absorbance detection. 

Oligonucleotide microarrays or "DNA chips" have also generated much attention 
for their potential for massively parallel analysis. The prospect of sequencing tens of thousands 
of bases of a smiall sample in just a few minutes is exciting. At present, tiiis technology has 
limited availability in that arrays to sequence only a handful of geines are currently available, 

10 with substantial hardware and consumable costs. In addition, the general approach of 

sequencing by hybridization is not particularly robust, with the requirement of significant 
sequence-dependent optimization of hybridization conditions. Nonetheless, the parallelism of an 
"array" technology is very powerful and multiplexed sequence determination is an important 
element of the new flow cj^ometry method. 

15 Other methods for SNP analysis include those in U.S. Patent No. 6,287,766. 

Briefly, a DNA sequence variation can be analyzed using the techniques outlined 
in the above referenced patent. In general, differences exist in similar DNA regions isolated 
from two individuals of a species. Such differences in DNA sequence are known as genetic 
sequence variation. Genetic sequence variation may result in phenotypic differences in the two 

20 individuals or may have no phenotypic effect whatsoever. Similarly, the genetic sequence 

variation may have a profound ejffect on the host of the different genetic sequence or it may have 
no effect whatsoever. 

Comparisons between two DNA samples can lead to useful genetic information. 
For example, with the various genome projects, reference or control sequences are available to 

25 use for comparison purposes. New DNA samples isolated from similar or dissimilar organisms 
can be compared to the known sequences using the techniques of the invention. Similarly, two 
different unknown samples can be compared. 

In general, a first DNA sample is attached to a solid support such as flow 
cytometry beads. The first DNA sample may be a known sample or an unknown DNA sample. 

30 Next, a test sample of DNA is isolated. The test sample may be a PGR product. The test sample 
is then incubated with the DNA attached to the solid support imder conditions suitable to permit 
DanA hybridization between the two DNA samples. DNA sequence differences are detected as 
DNA mismatches or other "mutations" in the hybrid DNA. 
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SNP Analysis 

Single Nucleotide Polymorphisms (SNP) can be detected using the techniques 
disclosed in flie 6,287,766 patent. Importantly, the SNPs can be known or unknown. 

1. Known SNPs 

Native or wild type DNA is attached to a solid support such as flow cytometry 
beads. Next, the test DNA sample is isolated. The DNA may be amplified by PGR using oligos 
that flank the SNP. The DNA sample is incubated with the native or wild ^e DNA attached to 
the solid support. Finally, the mixture is incubated with a labeled DNA mutation binding 

protein. A detected mismatch is indicative of a SNP. The test sample may be isolated ftom a 
patient. 

In an alternative embodiment, two solid supports are utilized. Native or wild type 
DNA is coupled to one support. DNA containing the known mutation is coupled to the other 
support. A DNA sample is incubated with both solid supports under conditions that allows DNA 
hybridization. Finally, the mixture is incubated with a labeled DNA mutation binding protein If 
the sample DNA has a SNP, the native DNA-support will show a mismatch and the mutant 
DNA-support will show no signal due to the match. If the sample DNA does not have the 
mutation, the native support will show a match and the mutant support will show a mismatch. 

The most efficient soHd support for known SNP analysis is flow cytometry. A 
Ubrary of DNA molecules of interest can be coupled to the beads. Such DNA can be produced 
by PGR. For example, if one is attempting to identify a SNP in a particular gene such as a 
BRCA breast cancer gene, one would PGR a blood sample firom a candidate to amplify the DNA 
using oligos that flank the region of interest. The PGR sample is then annealed to the beads 
containing the BRGA native DNA in sections of 100-200 bases with overlap to ensure SNP 
detection in the entke gene. A DNA mutation binding protein such as thermophilic MutS 
protein which recognizes all DNA mismatches is labeled with a detectable label such as biotin 
and is then incubated with the beads. Streptavidin Phycoerytfarin can be used as a reporter. If 
the DNA mutation binding protein (MutS) protein detects a DNA mutation, it binds to the DNA 
which is bound to the bead forming a DNA mutation binding protein~DNA~bead complex. The 
biotin attached to the DNA mutation-binding protein (MutS) is detected by the reporter using 
flow cytometry. 

The ability to assay 100 beads (each with a different form of DNA attached) per 
sample using flow cytometry in a matter of seconds makes the genomic approach to SNP 
detection feasible. This approach makes it possible to narrow down the SNP to approximately 
100 bases which can then be sequenced to identify the exact nature of the SNP or change in the 
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sequence. The DNA sequencing could be done from the initial PGR reaction used to anneal to 
the beads thus simplifying the SNP detection greatly. 

2. Unknown SNPs 

5 For genome wide search for SNPs, flow cytometry is the preferred form of 

detection. This method proposes attaching wild type DNA (in 200-500 base fragments where 
each fragment overlaps with the next by .aboutSO bases to insure that all DNA is read) to a 
library of beads. Preferably, smaller fragments such 200 bases are used since it is estimated that 
a SNP occurs every 1000 base pairs and it is desirable to narrow down the SNP to limit the 

10 amount of sequencing needed. As above, the patient sample is amplified and annealed to the 
beads containing the wild type DNA sequence. 

One hundred (100) beads (which can read in mutliplex fashion«at the same time 
due to the ability to detect each bead by its fluorescent signature) can be read in one test well. 
Therefore in 96 well format, 9600 sections of 200 bases can be read in less than an hour which 

15 correlates with 1,920,000 bases checked for SNPs on each 96 well plate. Mismatches can be 

detected in minutes thereby reflecting a SNP in the respective 200 base fragment on that bead. If 
that fragment corresponds to a DNA sequence of interest, ftirther sequencing (from the original 
PGR sample used to anneal to the beads) will identify the exact SNP sequence. 

The identity of the herein disclosed nucleic acid molecules will also find use in 

20 the identification of individuals harboring identical or substantially similar sequences in their 
genomes. This, finding, in turn, will aid the practicing physician in identifying those individuals 
at risk of not responding to medicaments aimed at normalizing endogenous glutamate levels 
especially those seeking treatment of mGluR 2 or mGluR3 related disorders characterized with 
low or non-existent levels of endogenoois glutamate, 

25 Thus, a diagnostic method for screening an individual at risk for non-compliance 

with an mGluR2/3 related therapeutic regimen comprises: 

(a) providing a biological sample of human cell/tissue; and 

(b) deteniiiiiing, in the sample, levels of expression of a gene product 
expressed from a nucleotide sequence which hybridizes under stringent conditions with a 

30 nucleotide sequence corresponding to one or more of SEQ.ID,NOS.:9-16, or SEQ.ID,NOS.:41- 
48, or a complement thereof, wherein said step comprises identifying a complex formed betwera 
SBxd gene product and an antibody having affinity for said gene product. 

An alternative method proposes assaying a biological sample from a subject 
35 imdergoing a therapeutic regiment to treat an mGluR2/3 related disorder characterized by 
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abnormal binding of glutamate to its respective receptor, for expression of mRNA of a gene 
comprising a sequence of nucleotides as set forth in SEQ.ID,NOS.:9-16, 25-32 or 41-48 or 
substantially identical thereto or encoding the polypeptide of SEQ.ID.N0S.:l-8, 17-24 or 33-40, 
wherein expression thereof is indicative of the presence of a mutant form of mGluR2 or 3 as 
5 disclosed herein. This finding, in turn, will aid in the identifications of individuals at risk of not 
responding to a therapeutic regiment aimed at correcting/nomializing levels of endogenous 
glutamate. In further thereto, it is believed that individuals having nucleotides sequences 
identical to or substantially similar to SEQ.ID.NOS.:9-16, 25-32 or 41-48 will have lower 
binding of a selective allosteric potentiator which would result in less potentiation by the 

10 endogenous glutamate which would eventually result in less inhibition of the presynaptic 
mGluR2 receptor resulting in increased synaptic activity of the presynaptic terminal. 

The proteins of this invention as well as firagments of these proteins may be used 
as antigens for the synthesis of antibodies. The term "antibody" as used herein describes 
antibodies, fi-agments of antibodies (such as, but not limited, to Fab, Fab', Fab.sub.2 *, and Fv 

15 firagments), and chimeric, humanized, veneered, resurfaced, or CDR-grafl:ed antibodies capable 
of binding antigens of a similar nature as the parent antibody molecule firom which they are 
derived. The instant invention also encompasses single chain polypeptide binding molecules. 

The term "antibody" as used herein is not limited by the manner in which the 
antibodies are produced, whether such production is in situ or not. The term "antibody" as used 

20 in this specification encompasses those antibodies produced by recombinant DNA technology 
means including, but not limited, to expression in bacteria, yeast, insect cell lines, or mammalian 
cell lines. 

The production of antibodies, both monoclonal and polyclonal, in animals, 
especially mice, is well known in the art. See. e.g., C. Miktein, Handbook of Experimental 

25 Immunology, (Blackwell Scientific Pub., 1986); J. Coding, Monoclonal Antibodies: Principles 
and Practice, (Academic Press, 1983). For the production of monoclonal antibodies the basic 
process begms with injectuag a mouse, or other suitable animal, with an immunogen. The mouse 
is subsequently sacrificed and cells taken fi:om its spleen are fiised with myeloma cells, resulting 
in a hybridoma that reproduces in vitro. The population of hybridomas is screened to isolate 

30 individual clones, each of which secretes a single antibody species, specific for the immunogen. 
The individual antibody species obtamed in this way is each the product of a single B cell fi-om 
the immxme animal generated in response to a specific antigenic site, or epitope, recognized on 
the immunogenic substance. 

Chimeric antibodies are described in U.S. Pat. No. 4,816,567, which issued Mar. 

35 28, 1989 to S. Cabilly, et al. This reference discloses methods and vectors for the preparation of 
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chimeric antibodies. The entire contents of U.S. Pat. No. 4,816,567 are incorporated herein by 
reference. An alternative approach to production of genetically engineered antibodies is 
provided in U.S. Pat No. 4,816,397, which also issued Mar. 28, 1989 to M. Boss, et al., the 
entire contents of which are incorporated herein by reference. The Boss patent teaches the 
5 simultaneous co-e;qpression of the heavy and light chains of the antibody in the same host cell. 

The approach of U.S. Pat. No. 4,816,397 has been further refined as taught in 
European Patent Publication No. 0 239 400, which published Sep. 30, 1987. The teachings of 
this European patent publication (Winter) are a preferred format for the genetic engineering of 
the reactive monoclonal antibodies of this invention. The Winter technology involves the 
10 replacement of complementarity determining regions (CDRS) of a human antibody with the 

CDRs of a murine monoclonal antibody thereby converting flie specificity of the human antibody 
to the specificity of the murine antibody which was the source of the CDR regions. This "CDR 
grafting" technology affords a molecule containing miniinal murine sequence and thus is less 
inmnmogenic. 

1 5 Single chain antibody technology is yet another variety of genetically engineered 

antibody which is now well known in the art. See. e.g. R. E. Bird, et al., Science 242:423-426 
(1988); PCT Publication No. WO 88/01649, which was published Mar. 10, 1988. The single 
chain antibody technology involves joining the binding regions of heavy and light chains with a 
polypeptide sequence to generate a single polypeptide having the binding specificity of the 

20 antibody firom which it was derived. 

The aforementioned genetic engineering approaches provide the skilled artisan 
with numerous means to generate molecules which retain the binding characteristics of the 
parental antibody while affording a less immimogenic format 

These antibodies are used in diagnostics, therapeutics or in diagnostic/therapeutic 

25 combinations. By "diagnostics" as used herein is meant testing that is related to either ttie in 
vitro or in vivo diagnosis of disease states or biological status in mammals, preferably in 
humans. By "therapeutics" and "therapeutic/diagnostic combinations" as used herein is 
respectively meant the treatment or the diagnosis and treatment of disease states or biological 
status by the in vivo administration to mammals, preferably humans, of the antibodies of the 

30 present invention. The antibodies of the present invention are used in the screening methods and 
in rational dmg development, as well as in other applications where the immunological binding 
of an antibody to one of the novel polypeptides, peptides or fiised proteins is desired. 

Nimierous other assay systems are also readily adaptable to detect agents which 
bind the novel polypeptides, peptides or fiised proteins of the present invention. Examples of 

35 these aforementioned assay systems are discussed in Methods in Enzymology, (J. Langone. and 
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H. Vunakis, eds. 1981), Vol. 73, Part B, the contents of which are incorporated herein by 
reference. Skilled artisans are directed to Section n of Methods in Enzymology, Vol. 73, Part B, 
supra, which discusses labeling of antibodies and antigens, and Section IV, which discusses 
inununoassay methods. Also, regarding the development of other antibodies, which are specific 
5 for the hypervariable regions of the mGluRx receptor antibodies, wherein some such anti- 
idiotypic antibodies would resemble the original epitope, and, therefore, would be useful in 
evaluating the effectiveness of compounds which are potential antagonists, agonists, or partial 
agonists of the mGluRx receptor itself (as opposed to the allosteric site), see. e.g., Cleveland, et 
al.. Nature (London), 305:56 (1983); Wasserman, et al.. Proceedings of the National Academy of 
10 Sciences (USA), 79:4810 (1982). 

Several methods for preparing and using the conq)ounds, and applying the 
methods, of this invention are illustrated in the following Examples. Starting materials are made 
according to procedures known in the art or as illustrated herein. 

15 EXAMPLE 1 

Site Directed M utagenesis Using the QuickChange Protocol of StratapeneTM; 

The following simunarizes the procedure that is employed to generate mutant 
forms of mGluR2 and mGIuRS at the specific amino acid locations indicated above. This is 
based on the protocol developed by StratageneTM. 

20 

1. Resuspend all primers in sterile water with a final concentration of lug/ul. 
Make working stocks of each primer by diluting to 125ng/ul. 

2. Prepare test reaction and positive control reaction according to table 
below, using components provided within the kit: 

25 



Reactants: 


Test reaction 


Control 


lOX Buffer 


5ul 


5ul 


Template 20ng/ul 


lul hmGluR2 


lul pWhitescript 


Prima: 1 125ng/ul 


lul hmG2 Primer 1 


1.25 ul primer 1 


Primer 2 125ng/ul 


lul hmG2 Primer 2 


1.25 ul primer 2 


dNTP 


lul 


lul 


Water 


40ul 


38.5ul 


PfoTurbo 


lul 


lul 
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10 



20 



25 



Conditions: 



1. Place samples in PGR thermo cycler and set the following PGR Gycling 



15 reaction. 



2. 
3. 
4. 
5. 
6. 
7. 

8. 

ice for 30 minutes. 

9. 

for 2 minutes. 

10. 
11. 
12. 
13. 



1 Cycle: 
18 Cycles: 



95<'C 
95°C 
5S°C 
68°C 
4''C 



30 seconds 
30 seconds 
60 seconds 
12 minutes 



Hold: 

Add lul Dpnl «izyme to each reaction and incubate for 1 hour at 37*'C. 
Prepare LB/AMP plates. 

Add lOul .IM IPTG and 50ul 40mg/ml XGal per plate and spread evenly. 
Let plates sit in 37''C-incubator 1 hour before using. 
Thaw competent cells on ice. 

Transfer 50uL of competent cells to pre-chilled falcon tubes, 1 tube per 

Add luL of Dpnl treated reactions to the separate tubes and incubate on 

Heat transformation in 42"'G-water bath for 45 seconds, then cool on ice 

Add 500uL NZY+ broth (42°C). Incubate at 37°G for 1 hour at 225 ipm. 
Plate 250ul of transformations per plate. Incubate overnight at 37'*C. 
Remove plates from 37*C and store at 4°C. 

Pick colonies from each clone and inoculated them into 4ml LB/AMP. 



30 



Incubate overnight at 37''C, 300rpm. 

Then DNA is isolated from overnight cultures using a mini-prep kit (QiagenTM 
or equivalrait) following the kit instructions. Samples of DNA are sequenced by standard 
sequencing protocols to determine if a desired, correct mutation has been incorporated into the 
sequence and no further mutations have been mis-incorporated. 
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EXAMPLE 2 

Lipid Transfection to Introduce Mutant DNA into Cells Using MAGECTM Multi-well Plate- 
based Protocol: 

For mGluR2 and mGluR3 mutants, the mGluR2/Gal6 nucleic acids are mixed 
with a lipid-based transfection reagent and are deposited into a well of a multi-well plate coated 
with fibronectin. The resulting DNA/lipid complexes are subsequently dried in vacuo then 
seeded with HEK293 cells. This is based on a method designated as MAGECTM, and disclosed 
in U.S. Provisional Application No. 60/372,476, filed April 15, 2002, and titled, "Matrix 
Analyses of Gene Expression in Cells." 

The following summarizes the steps of the method: 



1 . In a 15 ml tube, add 0.35 DNA of mGluR2 and Gal6 DNAs to 45 ^il 
of DNA-condensation buffer (Buffer EC from Effectene Kit, Qiagen, Inc.) in which sucrose has 

1 5 been dissolved to a concentration of 0.3 M, 

2. Add 4.5 |Lil of Enhancer solution, mix the contents by pipetting up vortex 
gently then incubate the mixture at room temperature for 5 minutes. 

3. Add 1 1 ^il Effectene transfection reagent; mix the solution with gentle 

vortexing. 

20 4. Incubate at room temperature for 10 minutes. 

5. Add 90 jil 0.25% glycogen, remix the solution and pipette 150 ^il into a 
well of multi-well fibronectin coated plate. 

6. Spread solution to cover the whole bottom of the well and let sit overnight 

at4°C. 

25 7. Dry plate in a SpeedVac or vacuum chamber. Dried plates can be used 

immediately or stored at 4'*C for an extended period. 

8. One day prior to transfection, cells are passaged to maintain growth in log- 
phase. 

9. On the day of transfection, cells are trypsinized and 600,000 cells/well are 
30 subsequently seeded into the previously coated and dried multi-well plate(s) in DMEM media 

containing glutamine and placed at 37''C with 6% C02- 
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10. Approximately 26-30 hours after plating, the media is removed from the 
cells and 1 ml of DMEM media minus glutamine is pipetted over the cells and the plates are 
placed back at 37*^0 with 6% CO2. 

1 1 . Approximately forty ho\irs after plating, the cells are analyzed in the 
phosphoinositide hydrolysis assay. 

EXAMPLES 
Measurement of PhosDhoinosittde Hvdrolvsis 

Phosphoinositide hydrolysis was determined by measuring the accumulation of 
tritiated inositol monophosphate (pHq-IPi) in the presence of LiCL The method is taken from 
Berridge: Biochem J (1983) 212: 849-58. This determination provides for comparative data on 
the level of potentiation and loss of potentiation when testing a purported or known allosteric 
modulator in the presence of glutamate and either a wild-type mGluR2 or mGluR3 and at least 
one mutant of the respective mGluR2 or mGluR3. 

1 . HEK293 cells that were previously transiently transfected with hmGluR2 
and Gal 6 were labeled ovemight with liaCi/well myo-[2-3H]-inositol in a glutamine-free 
DMEM medium. 

2. The following day, the medium was removed and the cells were washed 
two times with HBS buffer containing 125 mM NaCL 5 mM KCl, 0.62 mM MgS04, 1.8 mM 
CaCl2, 6 mM glucose and 20 mM HEPES pH 7.4 for 45 min at 37''C, 

3. Following the wash, cells were then incubated with HBS-buffer 
containing 10 mM LiCL 

4. After 20 min incubation in the LiCl containing buffer, varying 
concentrations of glutamate alone (5 uM (ECio) or 1 mM (ECi00))» or glutamate (5 uM (ECio) 
in combination with varying concentrations of a mGluR2 potentiator were added to the wells and 
incubated for an additional hour at 37''C. 

5. The reactions were terminated by aspiration of the media from the wells 
and the accumulated [3H]-inositol monophosphates were extracted by adding 1 ml of cold 
chloroform-methanol-HCl (4N) (200:100:2). 

6. The mixtures were then transferred to glass tubes containing 300 \il of 
chloroform and 400 jil of H2O and vortexing briefly. 
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7. The aqueous phase was separated from the organic phase, by 
centrifiigation at 4000 rpm for 3 min or the samples can be allowed to settle for 15 mm 

8. A 0.5 ml aliquot of the aqueous phase from each sample was added to an 
anion exchange column containing Dowex-l-X8 (200-300 mesh in the formate form). 

5 9. After the application of the sample to the column, the colunms were 

washed with 10 ml of a solution consistmg of 60 mM ammonium formate and 5 mM borax. 

1 0. [3H]- inositol monophosphates were eluted from the columns by the 
addition of 4 ml of a solution containing 800 mM ammonium formate and 0.1 M formic acid. 

1 1 . The column eluates were transferred to a 20 ml glass scintillation vial and 
10 16 ml of scintillation cocktail (Ecolume, ICN) was added. 

12. The sample was quantified in a scintillation counter after a two-hour 

waiting p^od. 



EXAMPLE 4 



The responses to novel nucleic acid sequences of the present invention were 
evaluated through comparisons with two known mGluR2 modulators. One of these modulators 
is designated herein as Compund A, has the chemical formula N-(3-(2-methoxyphenoxy)- 
phenyl-N-(2,2,2-trifluoroethylsulfonyl)pyrid-3-yhnethylamine, and is described in the PCT 
Patent Application No.: WO 01/56990. The second modulator is designated as Compound B. 

The interactions between the novel mutant forms of mGliiR2 and these 
modulators indicate the value of the novel nucleic acid sequences and polypeptides of the present 
invention for purposes including, but not limited to: 



1 . screening of potential mGluR2 modulators that are not active, or less 
25 active, for mGluRS; 

2. screening of potential mGluR3 modulators that are not active, or less 
active, for mGlxiR2; 

3. rational drug design using SAR to design specific modulators for mGluR2 
that are not active, or less active, for mGluR3; and 

30 4. rational drug design using SAR to design specific modulators for mGluR3 

that are not active, or less active, for mGluR2. 



Figure 1 shows the percent response of several treatments compared to a ImM 
glutamate positive control with wild type mGluR2. This control is normalized and set to "100 
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percent on the y-axis. This figure shows the effect of the two modulators, Compound A and 
Compound B, in up-regulating the response of mGluR2 when provided a relatively low level, 5 
uM of glutamate ("glu"), compared to the 1 mM level of the positive control (far right bar). 
Figure 2 shows the effect of the SG-688-689-LV mutant of mGluR2, 
5 (SEQ.ID.lIrO,:5), when the same modulators are applied. A substantial decrease of glutamate 
response is evident in comparison to the wild-type mGluR2 results in Figure 1. This indicates 
that the modulators which were effective at up-regulating wild-type mGluR2 in the presence of 5 
uM quantities of glutamate (relative to the 1 mM control) are both less effective at such up- 
modulation when acting on the SG-688-689-LV mutant of mGluR2. 

10 Figure 3 shows the effect of the N735D mutant of mGluR2, (SEQ.ID.NO.: 1), 

when the same modulators are applied. A substantial decrease of glutamate response is evident 
in comparison to the wild-type mGluR2 results in Figure 1. This indicates that the modulators 
which were effective at up-regulating wild-type mGluR2 in the presence of 5 uM quantities of 
glutamate (relative to the 1 mM control) are both less effective at such up-modulation when 

15 acting on the N735D mutant of mGluR2, It also is noted that the loss of effectiveness of these 
modulators is somewhat greater for this mutant than for the mutant of Figure 2, 

Figure 4 shows the effect of four combined point mutations - the SG-688-689-LV 
and AN(733,735)TD mutant of mGluR2, (SEQ,ID.NO.:8) when the same modulators are 
applied. A more substantial decrease of glutamate response is evident in comparison to the wild- 

20 type mGluR2 resxxlts in Figure 1 . This indicates that the modulators which were effective at up- 
regulating wild-type mGluR2 in the presence of 5 uM quantities of glutamate (relative to the 1 
mM control) are both less effective at such up-modulation when acting on the SG-688-689-LV 
and AN(733,735)TD mutant of mGluR2. It also is noted that the loss of effectiveness of these 
modulators is greater for this mutant than for the mutants of Figures 2 and 3. This suggests that 

25 this combination of mutations is more effective at disrapting an allosteric binding site for these 
modulators. 

Figure 5 shows the effect of two combined point mutations - the S-688-L and 
N735D mutant of mGIuR2, (SEQ.ID.NO.:6), when the same modulators are applied. A more 
substantial decrease of glutamate response is evident in comparison to the wild-type mGluR2 
30 results in Figure 1 . This indicates that the modulators which were effective at up-regulating 

wild-type mGluR2 in the presence of 5 xiM quantities of glutamate (relative to the 1 mM control) 
are both less effective at such up-modulation when acting on the S-688-L and N735D mutant of 
mGluR2. It also is noted that the loss of effectiveness of these modulators is greater for this 
mutant than for the mutants of Figures 2 and 3, and comparable to the results of Figure 4. This 
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suggests that this combination of mutations is more effective than the single point mutations, and 
as effective as the four-mutation combination depicted in Figure 4. 

Figure 6 shows the effect of two combined point mutations - the G-689-V and 
N735D mutant of mGluR2, (SEQ.ID.NO.:2), when the same modulators are applied. A more 
substantial decrease of glutamate response is evident in comparison to the wild-type mGluR2 
results m Figure 1. This indicates that the modulators which were effective at up-regulating 
wild-type mGluR2 in the presence of 5 uM quantities of glutamate (relative to the 1 mM control) 
are both less effective at such up-modulation when acting on the G-689-V and N735D mutant of 
mGluR2. It also is noted that the loss of effectiveness of these modulators is greater for this 
mutant than for the mutants of Figures 2 and 3, and comparable to the results in Figure 4. This 
suggests that this combination of mutations is more effective than the single point mutations, and 
as effective as the four-mutation combination depicted in Figure 4. 

Without being boimd to a specific theory, the above sets of results suggest that the 
amino acids at positions 688, 689, and 735 of mGluR2 are important in establishing an allosteric 
binding site for modulators that include the Compound A and Compound B modulators. The 
data indicate that substitution of amino acids in these positions dramatically affect the ability of 
such modulators to act on such mutated mGluR2 receptor molecules. Fxirther, Figures 5 and 6 
suggest that two mutations one at 735 and one at either 688 or 689, is essentially as effective as a 
mutant having four mutations. This adds to the knowledge of what stmcture is critical for the 
binding and activity of Compounds A and B, and similar modulators. This opens opportunities 
to exploit the discovery of these positions as critical to effectiveness of modulators, and to 
develop specific modulators. 

Accordingly, the teachings of the present invention find application in a number 
of areas, some of which are described in this and the following paragraphs. For example, 
without being limiting of the scope of the present mvention, the mutant forms of mGluR2 and 
mGluR3 disclosed herein are incorporated into chimeric nucleic acid sequences of human, rat, 
and mouse mGluR2 and/or mGluR3. The resultant chimeric mutant nucleic acids sequences, and 
the resultant polypeptides, are utilized in screening for modulators, testing of identified 
modulators for specificity, toxicity testing, and rational dmg design and development With 
regard to screening of substances for modulator properties, these mutant chimeric molecules, and 
the resultant polypeptides, are used in high-throughput functional drag screening assays in order 
to identify sub^e-specific modulators, particularly positive modulators, of mGluR2, and of 
mGluR3. 

Further regarding identification of the chimeric mutant nucleic acid sequences and 
polypeptides encoded by said sequences, novel polypeptides, peptides, and fiision proteins are 



-50- 



wo 2004/024936 PCT/US2003/028470 



prepared wherein such polypeptides, peptides, and fusion proteins identify the polypeptides 
encoded by said sequences. Also, novel nucleic acid sequences are prepared that encode these 
novel polypeptides, peptides, and fixsion proteins. Also, expression vectors expressing these 
novel nucleic acid sequences are prepared. 
5 In certain embodiments, these novel polypeptides, peptides, and fusion proteins 

are designed and constructed to specifically identify a unique or rare sequence of a specific 
mutant chimeric polypeptide of the present invention. Alternately, in other embodiments, novel 
polyclonal or monoclonal antibodies are prepared and purified to immunologically bind to said 
chimeric polypeptide. The specificity of such antibodies is selected to meet the purpose of the 

10 antibody/peptide binding requirements for a specific assay or other application. In more 

stringent specificity applications, the antibody is prepared to immxmologically bind to the site of 
interaction of the modulator, e.g., at the site modified by the mutant forms of mGluR2 or 
mGluRS of the present invention. 

In other embodiments, the corresponding sequences in other species are utilized 

15 for assays, comparison and evaluations with in vitro, in vivo cellular, in vivo whole organism 
(e.g., chimeric), and in vivo site-specific use of mutants of the present invention. In any of such 
embodiments, the following sequences are of utility: mutant rat mGluR2 peptides, 
SEQ.ID.NOs.: 17-24; mutant rat mGluR2 DNA nucleic acid sequences, SEQ.ID.NOS.:25-32; 
mutant himian mGluR3 peptides, SEQ.ID.NOS.:33-40; and mutant human mGluRB nucleic acid 

20 sequences, SEQ.ID.NOS..:41-48. 

In addition, the fluree sites of mutations in mGlu2 or mGluRS, and optionally 
nearby relevant amino acids that also are shown to play a role in the allosteric binding of a 
modulator in the TM4 and TM5 regions associated with these three sites of mutations, are 
candidates for Single Nucleotide Polymorphism (SNP) analyses. Such comparative 

25 pharmacogenetics are conducted either in the general population (to obtain baseline data) and/or 
in sub-populations of subjects in need of modulators for prevention or treatment of certain 
neurological conditions, such as schizophrenia and general anxiety disorder. 

The above additional application of the present invention are developed through 
techniques known to those skilled in the art using methods described in the present application 

30 and supported by the information incorporating by reference fi*om the patents, patent 
applications, publications, texts and references discussed or cited herein. 

More generally, all patents, patent applications, publications, texts and references 
discussed or cited herein are incorporated by reference to the same extent as if each individual 
publication or patent application was specifically and individually set forth in its entirety, and 

35 any such incorporation is not limited in any way to the particular aspect or method for which a 
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patent, patent application, publication, text or reference was cited herein. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosures by virtue 
of prior invention. In addition, all terms not specifically defined are first taken to have the 
meaning given through usage in this disclosure, and if no such meaning is inferable, their normal 
5 meaning. Where a limitation is described but not given a specific term, a term corresponding to 
such limitation may be taken firom any references, patents, appUcations, and other documents 
cited herem, or, for an appUcation claiming priority to this application, additionally firom an 
Invention Disclosure Statement, Examiner' s Summary of Cited References, or a paper otherwise 
entered into the file history of this {plication. 

Additional references useful in understanding aspects of the present disclosure 
and invention includes: D. D. Schoepp, "Glutamate receptors". Handbook of Receptors and 
Channels, Chapter 13 (S. J. Peroutka, ed., CRC Press, 1984); ESCRIBANO, A. et al., «(2S,4S)- 
2-Amino-4(2,2-Diphenylethyl) Pentanedioic Acid Selective Group 2 Metabotropic Glutamate 
Receptor Antagonist", Bioorg. Med, Chem. Lett., 1998, 765-770:8; HELTON, DAVID R., et al., 

15 "Anxiolytic and Side-Effect Profile of LY354740: A Potent, Highly Selective' Orally Active 
Agonist for Group II Metabotropic Glutamate Receptors", JPET, 1998, 651-760: 284, USA; 

KINGSTON, A.E., et al., "LY341495 is a Nanomolar Potent and Selective ' 
Antagonist of Group H Metabotropic Glutamate Receptors", Neuropharmacology, 1998, 1-12: 
37;MUKHOPADHYAYA,J.K., etal., "Synthesis of Nl -Substituted Analogues of (2R,4R)- 

20 4-Amino-pyTroUdine-2,4-dicarboxylic Acid as Agonists, Partial Agonists, and Antagonists of 
Group n Metabotropic Glutamate Receptors", Bioorg. Med. Chem. Lett., 2001, 1919-1924: 11; 
NAKAZATO, A. et al., "Synthesis, SARs, and Pharmacological Characterization of 2-Amino-3 
or 6-fluorobicyclo[3.1.0] hexane-2,6-dicarboxylic Acid Derivatives as Potent, Selective, and 
Orally Active Qroup H Metabotropic Glutamate Receptor Agonists", J. Med. Chem., 2000, 

25 4893-4909: 43, 25; NAPLES, M.A. & HAMPSON D.R., "Pharmacological Profiles of the 

Metabotropic Glutamate Receptor Ligands [3 H]L-AP4 and [3 H] CPPG", Neuropharmacology, 
2001, 170-177: 40; ORNSTEIN, P.L. et al., "2-Substituted (2SR)-2-Amino-2-((lSR,2 SR)— 
carboxycycloprop-l-yl) glycines as Potent and Selective Antagonists of Group H Metabotropic 
Glutamate Receptors. 1. Effects of Alkyl, Arylalkyl, and Diarylalkyl Substitution". J. Med. 

30 Chem., 1998, 346-357: 41 . 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description. Such 
modifications are intended to fall withm tiie scope of the appended claims. Thus, for the above 
35 variations and in other regards, it should be understood that the examples and embodiments 
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described herein are for illustrative purposes only and that various modifications or changes in 
light thereof will be suggested to persons skilled in the art and are to be included within the spirit 
and purview of this application and the scope of the appended claims. 

Also, although only a few exemplary embodiments of this invention have been 
described in detail above, those skilled in the art will readily appreciate that many modifications 
are possible without undue experimentation based on the methods and citations provided, and 
based on the exemplary embodiments herein, without materially departing firom the novel 
teachings and advantages of this invention. Accordingly, all such modifications are intended to 
be included in the scope of this invention as defined in the following claims. In the claims, 
means-plus-function clauses and step-plus-fimction clauses are intended to cover the stractures 
described herein as effectuating or performing the recited function and to cover not only 
structural equivalents, but also to cover equivalent structures as one of ordinary skill in the art 
would understand equivalence with regard to a any means or any step that will achieve a stated 
function in an equivalent manner. For instance, a "means to detect a modulator of mGluR2" 
should be taken to include methods now or later known to those of skill in the art regardless of 
differences in the exact steps and reagents required to achieve this function. 
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Seq ZD 1: N735D 

5 10 15 20 

Viet Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly lie Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 ' 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 
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405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 , 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn lie Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu lie Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys lie Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

lie Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Ser Gly Gin Leu Leu He Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro ile Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 
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Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His lie lie Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seq ID 2: G689V N735D 

5 10 15 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Tiir Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 
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325 330 335 340 

Ser Tyr Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr* Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

€05 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Ser Val Gin Leu Leu He Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 
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725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu. lie Ala 

745 750 755 760 

Leu Cys- Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe lie Gly Phe Thr Met Tyr Thr Thr Cys lie He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 

Seq ZD 3 s S688Z1, 6689V, N735D 

5 10 15 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp .Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Tlir Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 
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245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala lie Thr lie Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Txp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Taa His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 ' 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Tarp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe lie Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 
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645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg lie Ala Arg lie 

665 670 675 680 

Phe Gly.Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe lie Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala lie Cys Leu Ala Leu lie Leu Val Gin Leu Leu lie Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala - Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu lie Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seg ID 4 s 8688X1 

5 10 15 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 
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165 170 175 180 

Pro Gin lie Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu lie Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

lie Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 • 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 ' 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 
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565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Leu Gly Gin Leu Leu He Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seq ID 5: S688I. G689V 

5 10 15 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 
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85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 
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485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 * 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Leu Val Gin Leu Leu He Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Tlir Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 
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Seq ID 6s 8688X1 lOSSD 

5 10 15 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala S,er Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu T^g Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 
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405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Leu Gly Gin Leu Leu He Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 
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Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His lie He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 

Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Se<3 ID 7 s G689V 

5 10 15 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 . 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 • 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 
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325 330 335 340 

Ser Tyr Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn lie Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Ser Val Gin Leu Leu He Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 
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Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val Leu Leu lie Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825' 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 

Seq[ ZD 8: S688Zt, G689V, A733T, NTSSD 

5 10 15 . 20 

Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Asp Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 .75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 ' 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Gly Asp Ala Pro Thr Ala He Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 
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245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 



^r^^' 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Ser Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala lie Thr lie Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gin Arg Phe Arg Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr His Asn Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Leu He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg T3:p Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe He Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 
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Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg lie Ala Arg lie 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Leu Val Gin Leu Leu lie Val Val Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Thr Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Ala 

845 850 855 860 

Arg Ala Ser Ser Ser Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



wo 2004/024936 



PCTAJS2003/028470 



Seq XD 9: N735D 

1 atgggatcgc tgcttgcgct cctggcactg ctgccgctgt ggggtgctgt ggctgagggc 
61 ccagccaaga aggtgctgac cctggaggga gacttggtgc tgggtgggct gttcccagtg 
121 caccagaagg gcggcccagc agaggactgt ggtcctgtca atgagcaccg tggcatccag 
181 cgcctggagg ccatgctttt tgcactggac cgcatcaacc gtgacccgca cctgctgcct 
241 ggcgtgcgcc tgggtgcaca catcctcgac agttgctcca aggacacaca tgcgctggag 
301 caggcactgg actttgtgcg tgcctcactc agccgtggtg ctgatggctc acgccacatc 
361 tgccccgacg gctcttatgc gacccatggt gatgctccca ctgccatcac tggtgttatt 
421 ggcggttcct acagtgatgt ctccatccag gtggccaacc tcttgaggct atttcagatc 
481 ccacagatta gctacgcctc taccagtgcc aagctgagtg acaagtcccg ctatgactac 
541 tttgcccgca cagtgcctcc tgacttcttc caagccaagg ccatggctga gattctccgc 
601 ttcttcaact ggacctatgt gtccactgtg gcgtctgagg gcgactatgg cgagacaggc 
661 attgaggcct ttgagctaga ggctcgtgcc cgcaacatct gtgtggccac ctcggagaaa 
721 gtgggccgtg ccatgagccg cgcggccttt gagggtgtgg tgcgagccct gctgcagaag 
781 cccagtgccc gcgtggctgt cctgttcacc cgttctgagg atgcccggga gctgcttgct 
841 gccagccagc gcctcaatgc cagcttcacc tgggtggcca gtgatggttg gggggccctg 
901 gagagtgtgg tggcaggcag tgagggggct gctgagggtg ctatcaccat cgagcfcggcc 
961 tcctacccca tcagtgactt tgcctcctac ttccagagcc tggacccttg gaacaacagc 
1021 cggaacccct ggttccgtga attctgggag cagaggttcc gctgcagctt ccggcagcga 
1081 gactgcgcag cccactctct ccgggctgtg ccctttgagc aggagtccaa gatcatgttt 
1141 gtggtcaatg cagtgtacgc catggcccat gcgctccaca acatgcaccg tgccctctgc 
1201 cccaacacca cccggctctg tgacgcgatg cggccagtta acgggcgccg cctctacaag 
1261 gactttgtgc tcaacgtcaa gtttgatgcc ccctttcgcc cagctgacac ccacaatgag 
1321 gtccgctttg accgctttgg tgatggtatt ggccgctaca acatcttcac ctatctgcgt 
1381 gcaggcagtg ggcgctatcg ctaccagaag gtgggctact gggcagaagg cttgactctg 
1441 gacaccagcc tcatcccatg ggcctcaccc tcagccggcc ccctgcccgc ctctcgctgc 
1501 agtgagccct gcctccagaa tgaggtgaag agtgtgcagc cgggcgaagt ctgctgctgg 
1561 ctctgcattc cgtgccagcc ctatgagtac cgattggacg aattcacttg cgctgattgt 
1621 ggcctgggct actggcccaa tgccagcctg actggctgct tcgaactgcc ccaggagtac 
1681 atccgctggg gcgatgcctg ggctgtggga cctgtcacca tcgcctgcct cggtgccctg 
1741 gccaccctct ttgtgctggg tgtctttgtg cggcacaatg ccacaccagt ggtcaaggcc 
1801 tcaggtcggg agctctgcta catcctgctg ggtggtgtct tcctctgcta ctgcatgacc 
1861 ttcatcttca ttgccaagcc atccacggca gtgtgtacct tacggcgtct tggtttgggc 
1921 actgccttct ctgtctgcta ctcagccctg ctcaccaaga ccaaccgcat tgcacgcatc 
1981 ttcggtgggg cccgggaggg tgcccagcgg ccacgcttca tcagtcctgc ctcacaggtg 
2041 gccatctgcc tggcacttat ctcgggccag ctgctcatcg tggtcgcctg gctggtggtg 
2101 gaggcaccgg gcacaggcaa ggagacagcc cccgaacggc gggaggtggt gacactgcgc 
2161 tgcaaccacc gcgatgcaag tatgttgggc tcgctggcct acgatgtgct cctcatcgcg 
2221 ctctgcacgc tttatgcctt caagactcgc aagtgccccg aaaacttcaa cgaggccaag 
2281 ttcattggct tcaccatgta caccacctgc atcatctggc tggcattctt gcccatcttc 
2341 tatgtcacct ccagtgacta ccgggtacag accaccacca tgtgcgtgtc agtcagcctc 
2401 agcggctccg tggtgcttgg ctgcctcttt gcgcccaagc tgcacatcat cctcttccag 
2461 ccgcagaaga acgtggttag ccaccgggca cccaccagcc gctttggcag tgctgctgcc 
2521 agggccagct ccagccttgg ccaagggtct ggctcccagt ttgtccccac tgtttgcaat 
2581 ggccgtgagg tggtggactc gacaacgtca tcgctttga 
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Seq ZD 10 s 6689V N735D 

1 atgggatcgc tgcttgcgct cctggcactg 
61 ccagccaaga aggtgctgac cctggaggga 
121 caccagaagg gcggcccagc agaggactgt 
181 cgcctggagg ccatgctttt tgcactggac 
241 ggcgtgcgcc tgggtgcaca catcctcgac 
301 caggcactgg actttgtgcg tgcctcactc 
361 tgccccgacg gctcttatgc gacccatggt 
421 ggcggttcct acagtgatgt ctccatccag 
481 ccacagatta gctacgcctc taccagtgcc 
541 tttgcccgca cagtgcctcc tgacttcttc 
601 ttcttcaact ggacctatgt gtccactgtg 
661 attgaggcct ttgagctaga ggctcgtgcc 
721 gtgggccgtg ccatgagccg cgcggccttt 
781 cccagtgccc gcgtggctgt cctgttcacc 
841 gccagccagc gcctcaatgc cagcttcacc 
901 gagagtgtgg tggcaggcag tgagggggct 
961 tcctacccca tcagtgactt tgcctcctac 
1021 cggaacccct ggttccgtga attctgggag 
1081 gactgcgcag cccactctct ccgggctgtg 
1141 gtggtcaatg cagtgtacgc catggcccat 
1201 cccaacacca cccggctctg tgacgcgatg 
1261 gactttgtgc tcaacgtcaa gtttgatgcc 
1321 gtccgctttg accgctttgg tgatggtatt 
1381 gcaggcagtg ggcgctatcg ctaccagaag 
1441 gacaccagcc tcatcccatg ggcctcaccc 
1501 agtgagccct gcctccagaa tgaggtgaag 
1561 ctctgcattc cgtgccagcc ctatgagtac 
1621 ggcctgggct actggcccaa tgccagcctg 
1681 atccgctggg gcgatgcctg ggctgtggga 
1741 gccaccctct ttgtgctggg tgtctttgtg 
1801 tcaggtcggg agctctgcta catcctgctg 
1861 ttcatcttca ttgccaagcc atccacggca 
1921 actgccttct ctgtctgcta ctcagccctg 
1981 ttcggtgggg cccgggaggg tgcccagcgg 
2041 gccatctgcc tggcacttat ctcggtccag 
2101 gaggcaccgg gcacaggcaa ggagacagcc 
2161 tgcaaccacc gcgatgcaag tatgttgggc 
2221 ctctgcacgc tttatgcctt caagactcgc 
2281 ttcattggct tcaccatgta caccacctgc 
2341 tatgtcacct ccagtgacta ccgggtacag 
2401 agcggctccg tggtgcttgg ctgcctcttt 
2461 ccgcagaaga acgtggttag ccaccgggca 
2521 agggccagct ccagccttgg ccaagggtct 
2581 ggccgtgagg tggtggactc gacaacgtca 



ctgccgctgt ggggtgctgt ggctgagggc 
gacttggtgc tgggtgggct gttcccagtg 
ggtcctgtca atgagcaccg tggcatccag 
cgcatcaacc gtgacccgca cctgctgcct 
agttgctcca aggacacaca tgcgctggag 
agccgtggtg ctgatggctc acgccacatc 
gatgctccca ctgccatcac tggtgttatt 
gtggccaacc tcttgaggct atttcagatc 
aagctgagtg acaagtcccg ctatgactac 
caagccaagg ccatggctga gattctccgc 
gcgtctgagg gcgactatgg cgagacaggc 
cgcaacatct gtgtggccac ctcggagaaa 
gagggtgtgg tgcgagccct gctgcagaag 
cgttctgagg atgcccggga gctgcttgct 
tgggtggcca gtgatggttg gggggccctg 
gctgagggtg ctatcaccat cgagctggcc 
ttccagagcc tggacccttg gaacaacagc 
cagaggttcc gctgcagctt ccggcagcga 
ccctttgagc aggagtccaa gatcatgttt 
gcgctccaca acatgcaccg tgccctctgc 
cggccagtta acgggcgccg cctctacaag 
ccctttcgcc cagctgacac ccacaatgag 
ggccgctaca acatcttcac ctatctgcgt 
gtgggctact gggcagaagg cttgactctg 
tcagccggcc ccctgcccgc ctctcgctgc 
agtgtgcagc cgggcgaagt ctgctgctgg 
cgattggacg aattcacttg cgctgattgt 
actggctgct tcgaactgcc ccaggagtac 
cctgtcacca tcgcctgcct cggtgccctg 
cggcacaatg ccacaccagt ggtcaaggcc 
ggtggtgtct tcctctgcta ctgcatgacc 
gtgtgtacct tacggcgtct tggtttgggc 
ctcaccaaga ccaaccgcat tgcacgcatc 
ccacgcttca tcagtcctgc ctcacaggtg 
ctgctcatcg tggtcgcctg gctggtggtg 
cccgaacggc gggaggtggt gacactgcgc 
tcgctggcct acgatgtgct cctcatcgcg 
aagtgccccg aaaacttcaa cgaggccaag 
atcatctggc tggcattctt gcccatcttc 
accaccacca tgtgcgtgtc agtcagcctc 
gcgcccaagc tgcacatcat cctcttccag 
cccaccagcc gctttggcag tgctgctgcc 
ggctcccagt ttgtccccac tgtttgcaat 
tcgctttga 
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Seq ZD lis S688I., 6689V, xn35D 

1 atgggatcgc tgcttgcgct cctggcactg ctgccgctgt ggggtgctgt ggctgagggc 
61 ccagccaaga aggtgctgac cctggaggga gacttggtgc tgggtgggct gttcccagtg 
121 caccagaagg gcggcccagc agaggactgt ggtcctgtca atgagcaccg tggcatccag 
181 cgcctggagg ccatgctttt tgcactggac cgcatcaacc gtgacccgca cctgctgcct 
241 ggcgtgcgcc tgggtgcaca catcctcgac agttgctcca aggacacaca tgcgctggag 
301 caggcactgg actttgtgcg tgcctcactc agccgtggtg ctgatggctc acgccacatc 
361 tgccccgacg gctcttatgc gacccatggt gatgctccca ctgccatcac tggtgttatt 
421 ggcggttcct acagtgatgt ctccatccag gtggccaacc tcttgaggct atttcagatc 
481 ccacagatta gctacgcctc taccagtgcc aagctgagtg acaagtcccg ctatgactac 
541 tttgcccgca cagtgcctcc tgacttcttc caagccaagg ccatggctga gattctccgc 
601 ttcttcaact ggacctatgt gtccactgtg gcgtctgagg gcgactatgg cgagacaggc 
661 attgaggcct ttgagctaga ggctcgtgcc cgcaacatct gtgtggccac ctcggagaaa 
721 gtgggccgtg ccatgagccg cgcggccttt gagggtgtgg tgcgagccct gctgcagaag 
781 cccagtgccc gcgtggctgt cctgttcacc cgttctgagg atgcccggga gctgcttgct 
841 gccagccagc gcctcaatgc cagcttcacc tgggtggcca gtgatggttg gggggccctg 
901 gagagtgtgg tggcaggcag tgagggggct gctgagggtg ctatcaccat cgagctggcc 
961 tcctacccca tcagtgactt tgcctcctac ttccagagcc tggacccttg gaacaacagc 
1021 cggaacccct ggttccgtga attctgggag cagaggttcc gctgcagctt ccggcagcga 
1081 gactgcgcag cccactctct ccgggctgtg ccctttgagc aggagtccaa gatcatgttt 
1141 gtggtcaatg cagtgtacgc catggcccat gcgctccaca acatgcaccg tgccctctgc 
1201 cccaacacca cccggctctg tgacgcgatg cggccagtta acgggcgccg cctctacaag 
1261 gactttgtgc tcaacgtcaa gtttgatgcc ccctttcgcc cagctgacac ccacaatgag 
1321 gtccgctttg accgctttgg tgatggtatt ggccgctaca acatcttcac ctatctgcgt 
1381 gcaggcagtg ggcgctatcg ctaccagaag gtgggctact gggcagaagg cttgactctg 
1441 gacaccagcc tcatcccatg ggcctcaccc tcagccggcc ccctgcccgc ctctcgctgc 
1501 agtgagccct gcctccagaa tgaggtgaag agtgtgcagc cgggcgaagt ctgctgctgg 
1561 ctctgcattc cgtgccagcc ctatgagtac cgattggacg aattcacttg cgctgattgt 
1621 ggcctgggct actggcccaa tgccagcctg actggctgct tcgaactgcc ccaggagtac 
1681 atccgctggg gcgatgcctg ggctgtggga cctgtcacca tcgcctgcct cggtgccctg 
1741 gccaccctct ttgtgctggg tgtctttgtg cggcacaatg ccacaccagt ggtcaaggcc 
1801 tcaggtcggg agctctgcta catcctgctg ggtggtgtct tcctctgcta ctgcatgacc 
1861 ttcatcttca ttgccaagcc atccacggca gtgtgtacct tacggcgtct tggtttgggc 
1921 actgccttct ctgtctgcta ctcagccctg ctcaccaaga ccaaccgcat tgcacgcatc 
1981 ttcggtgggg cccgggaggg tgcccagcgg ccacgcttca tcagtcctgc ctcacaggtg 
2041 gccatctgcc tggcacttat cttggtccag ctgctcatcg tggtcgcctg gctggtggtg 
2101 gaggcaccgg gcacaggcaa ggagacagcc cccgaacggc gggaggtggt gacactgcgc 
2161 tgcaaccacc gcgatgcaag tatgttgggc tcgctggcct acgatgtgct cctcatcgcg 
2221 ctctgcacgc tttatgcctt caagactcgc aagtgccccg aaaacttcaa cgaggccaag 
2281 ttcattggct tcaccatgta caccacctgc atcatctggc tggcattctt gcccatcttc 
2341 tatgtcacct ccagtgacta ccgggtacag accaccacca tgtgcgtgtc agtcagcctc 
2401 agcggctccg tggtgcttgg ctgcctcttt gcgcccaagc tgcacatcat cctcttccag 
2461 ccgcagaaga acgtggttag ccaccgggca cccaccagcc gctttggcag tgctgctgcc 
2521 agggccagct ccagccttgg ccaagggtct ggctcccagt ttgtccccac tgtttgcaat 
2581 ggccgtgagg tggtggactc gacaacgtca tcgctttga 
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Seq ZD 12 s S688Z. 

1 atgggatcgc tgcttgcgct cctggcactg ctgccgctgt ggggtgctgt ggctgagggc 
61 ccagccaaga aggtgctgac cctggaggga gacttggtgc tgggtgggct gttcccagtg 
121 caccagaagg gcggcccagc agaggactgt ggtcctgtca atgagcaccg tggcatccag 
181 cgcctggagg ccatgctttt tgcactggac cgcatcaacc gtgacccgca cctgctgcct 
241 ggcgtgcgcc tgggtgcaca catcctcgac agttgctcca aggacacaca tgcgctggag 
301 caggcactgg actttgtgcg tgcctcactc agccgtggtg ctgatggctc acgccacatc 
361 tgccccgacg gctcttatgc gacccatggt gatgctccca ctgccatcac tggtgttatt 
421 ggcggttcct acagtgatgt ctccatccag gtggccaacc tcttgaggct atttcagatc 
481 ccacagatta gctacgcctc taccagtgcc aagctgagtg acaagtcccg ctatgactac 
541 tttgcccgca cagtgcctcc tgacttcttc caagccaagg ccatggctga gattctccgc 
601 ttcttcaact ggacctatgt gtccactgtg gcgtctgagg gcgactatgg cgagacaggc 
661 attgaggcct ttgagctaga ggctcgtgcc cgcaacatct gtgtggccac ctcggagaaa 
721 gtgggccgtg ccatgagccg cgcggccttt gagggtgtgg tgcgagccct gctgcagaag 
781 cccagtgccc gcgtggctgt cctgttcacc cgttctgagg atgcccggga gctgcttgct 
841 gccagccagc gcctcaatgc cagcttcacc tgggtggcca gtgatggttg gggggccctg 
901 gagagtgtgg tggcaggcag tgagggggct gctgagggtg ctatcaccat cgagctggcc 
961 tcctacccca tcagtgactt tgcctcctac ttccagagcc tggacccttg gaacaacagc 
1021 cggaacccct ggttccgtga attctgggag cagaggttcc gctgcagctt ccggcagcga 
1081 gactgcgcag cccactctct ccgggctgtg ccctttgagc aggagtccaa gatcatgttt 
1141 gtggtcaatg cagtgtacgc catggcccat gcgctccaca acatgcaccg tgccctctgc 
1201 cccaacacca cccggctctg tgacgcgatg cggccagtta acgggcgccg cctctacaag 
1261 gactttgtgc tcaacgtcaa gtttgatgcc ccctttcgcc ' cagctgacac ccacaatgag 
1321 gtccgctttg accgctttgg tgatggtatt ggccgctaca acatcttcac ctatctgcgt 
1381 gcaggcagtg ggcgctatcg ctaccagaag gtgggctact gggcagaagg cttgactctg 
1441 gacaccagcc tcatcccatg ggcctcaccc tcagccggcc ccctgcccgc ctctcgctgc 
1501 agtgagccct gcctccagaa tgaggtgaag agtgtgcagc cgggcgaagt ctgctgctgg 
1561 ctctgcattc cgtgccagcc ctatgagtac cgattggacg aattcacttg cgctgattgt 
1621 ggcctgggct actggcccaa tgccagcctg actggctgct tcgaactgcc ccaggagtac 
1681 atccgctggg gcgatgcctg ggctgtggga cctgtcacca tcgcctgcct cggtgccctg 
1741 gccaccctct ttgtgctggg tgtctttgtg cggcacaatg ccacaccagt ggtcaaggcc 
1801 tcaggtcggg agctctgcta catcctgctg ggtggtgtct tcctctgcta ctgcatgacc 
1861 ttcatcttca ttgccaagcc atccacggca gtgtgtacct tacggcgtct tggtttgggc 
1921 actgccttct ctgtctgcta ctcagccctg ctcaccaaga ccaaccgcat tgcacgcatc 
1981 ttcggtgggg cccgggaggg tgcccagcgg ccacgcttca tcagtcctgc ctcacaggtg 
2041 gccatctgcc tggcacttat cttgggccag ctgctcatcg tggtcgcctg gctggtggtg 
2101 gaggcaccgg gcacaggcaa ggagacagcc cccgaacggc gggaggtggt gacactgcgc 
2161 tgcaaccacc gcgatgcaag tatgttgggc tcgctggcct acaatgtgct cctcatcgcg 
2221 ctctgcacgc tttatgcctt caagactcgc aagtgccccg aaaacttcaa cgaggccaag 
2281 ttcattggct tcaccatgta caccacctgc atcatctggc tggcattctt gcccatcttc 
2341 tatgtcacct ccagtgacta ccgggtacag accaccacca tgtgcgtgtc agtcagcctc 
2401 agcggctccg tggtgcttgg ctgcctcttt gcgcccaagc tgcacatcat cctcttccag 
2461 ccgcagaaga acgtggttag ccaccgggca cccaccagcc gctttggcag tgctgctgcc 
2521 agggccagct ccagccttgg ccaagggtct ggctcccagt ttgtccccac tgtttgcaat 
2581 ggccgtgagg tggtggactc gacaacgtca tcgctttga 
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Seq XD 13: S688I., G689V 

1 atgggatcgc tgcttgcgct cctggcactg 
61 ccagccaaga aggtgctgac cctggaggga 
121 caccagaagg gcggcccagc agaggactgt 
181 cgcctggagg ccatgctttt tgcactggac 
241 ggcgtgcgcc tgggtgcaca catcctcgac 
301 caggcactgg actttgtgcg tgcctcactc 
361 tgccccgacg gctcttatgc gacccatggt 
421 ggcggttcct acagtgatgt ctccatccag 
481 ccacagatta gctacgcctc taccagtgcc 
541 tttgcccgca cagtgcctcc tgacttcttc 
601 ttcttcaact ggacctatgt gtccactgtg 
661 attgaggcct ttgagctaga ggctcgtgcc 
721 gtgggccgtg ccatgagccg cgcggccttt 
781 cccagtgccc gcgtggctgt cctgttcacc 
841 gccagccagc gcctcaatgc cagcttcacc 
901 gagagtgtgg tggcaggcag tgagggggct 
961 tcctacccca tcagtgactt tgcctcctac 
1021 cggaacccct ggttccgtga attctgggag 
1081 gactgcgcag cccactctct ccgggctgtg 
1141 gtggtcaatg cagtgtacgc catggcccat 
1201 cccaacacca cccggctctg tgacgcgatg 
1261 gactttgtgc tcaacgtcaa gtttgatgcc 
1321 gtccgctttg accgctttgg tgatggtatt 
1381 gcaggcagtg ggcgctatcg ctaccagaag 
1441 gacaccagcc tcatcccatg ggcctcaccc 
1501 agtgagccct gcctccagaa tgaggtgaag 
1561 ctctgcattc cgtgccagcc ctatgagtac 
1621 ggcctgggct actggcccaa tgccagcctg 
1681 atccgctggg gcgatgcctg ggctgtggga 
1741 gccaccctct ttgtgctggg tgtctttgtg 
1801 tcaggtcggg agctctgcta catcctgctg 
1861 ttcatcttca ttgccaagcc atccacggca 
1921 actgccttct ctgtctgcta ctcagccctg 
1981 ttcggtgggg cccgggaggg tgcccagcgg 
2041 gccatctgcc tggcacttat cttggtccag 
2101 gaggcaccgg gcacaggcaa ggagacagcc 
2161 tgcaaccacc gcgatgcaag tatgttgggc 
2221 ctctgcacgc tttatgcctt caagactcgc 
2281 ttcattggct tcaccatgta caccacctgc 
2341 tatgtcacct ccagtgacta ccgggtacag 
2401 agcggctccg tggtgcttgg ctgcctcttt 
2461 ccgcagaaga acgtggttag ccaccgggca 
2521 agggccagct ccagccttgg ccaagggtct 
2581 ggccgtgagg tggtggactc gacaacgtca 



ctgccgctgt ggggtgctgt ggctgagggc 
gacttggtgc tgggtgggct gttcccagtg 
ggtcctgtca atgagcaccg tggcatccag 
cgcatcaacc gtgacccgca cctgctgcct 
agttgctcca aggacacaca tgcgctggag 
agccgtggtg ctgatggctc acgccacatc 
gatgctccca ctgccatcac tggtgttatt 
gtggccaacc tcttgaggct atttcagatc 
aagctgagtg acaagtcccg ctatgactac 
caagccaagg ccatggctga gattctccgc 
gcgtctgagg gcgactatgg cgagacaggc 
cgcaacatct gtgtggccac ctcggagaaa 
gagggtgtgg tgcgagccct gctgcagaag 
cgttctgagg atgcccggga gctgcttgct 
tgggtggcca gtgatggttg gggggccctg 
gctgagggtg ctatcaccat cgagctggcc 
ttccagagcc tggacccttg gaacaacagc 
cagaggttcc gctgcagctt ccggcagcga 
ccctttgagc aggagtccaa gatcatgttt 
gcgctccaca acatgcaccg tgccctctgc 
cggccagtta acgggcgccg cctctacaag 
ccctttcgcc cagctgacac ccacaatgag 
ggccgctaca acatcttcac ctatctgcgt 
gtgggctact gggcagaagg cttgactctg 
tcagccggcc ccctgcccgc ctctcgctgc 
agtgtgcagc cgggcgaagt ctgctgctgg 
cgattggacg aattcacttg cgctgattgt 
actggctgct tcgaactgcc ccaggagtac 
cctgtcacca tcgcctgcct cggtgccctg 
cggcacaatg ccacaccagt ggtcaaggcc 
ggtggtgtct tcctctgcta ctgcatgacc 
gtgtgtacct tacggcgtct tggtttgggc 
ctcaccaaga ccaaccgcat tgcacgcatc 
ccacgcttca tcagtcctgc ctcacaggtg 
ctgctcatcg tggtcgcctg gctggtggtg 
cccgaacggc gggaggtggt gacactgcgc 
tcgctggcct acaatgtgct cctcatcgcg 
aagtgccccg aaaacttcaa cgaggccaag 
atcatctggc tggcattctt gcccatcttc 
accaccacca tgtgcgtgtc agtcagcctc 
gcgcccaagc tgcacatcat cctcttccag 
cccaccagcc gctttggcag tgctgctgcc 
ggctcccagt ttgtccccac tgtttgcaat 
tcgctttga 
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Seq ID 14: S688Zi, ir735D 

1 atgggatcgc tgcttgcgct cctggcactg 
61 ccagccaaga aggtgctgac cctggaggga 
121 caccagaagg gcggcccagc agaggactgt 
181 cgcctggagg ccatgctttt tgcactggac 
241 ggcgtgcgcc tgggtgcaca catcctcgac 
301 caggcactgg actttgtgcg tgcctcactc 
361 tgccccgacg gctcttatgc gacccatggt 
421 ggcggttcct acagtgatgt ctccatccag 
481 ccacagatta gctacgcctc taccagtgcc 
541 tttgcccgca cagtgcctcc tgacttcttc 
601 ttcttcaact ggacctatgt gtccactgtg 
661 attgaggcct ttgagctaga ggctcgtgcc 
721 gtgggccgtg ccatgagccg cgcggccttt 
781 cccagtgccc gcgtggctgt cctgttcacc 
841 gccagccagc gcctcaatgc cagcttcacc 
901 gagagtgtgg tggcaggcag tgagggggct 
961 tcctacccca tcagtgactt tgcctcctac 
1021 cggaacccct ggttccgtga attctgggag 
1081 gactgcgcag cccactctct ccgggctgtg 
1141 gtggtcaatg cagtgtacgc catggcccat 
12 01 cccaacacca cccggctctg tgacgcgatg 
1261 gactttgtgc tcaacgtcaa gtttgatgcc 
1321 gtccgctttg accgctttgg tgatggtatt 
1381 gcaggcagtg ggcgctatcg ctaccagaag 
1441 gacaccagcc tcatcccatg ggcctcaccc 
1501 agtgagccct gcctccagaa tgaggtgaag 
1561 ctctgcattc cgtgccagcc ctatgagtac 
1621 ggcctgggct actggcccaa tgccagcctg 
1681 atccgctggg gcgatgcctg ggctgtggga 
1741 gccaccctct ttgtgctggg tgtctttgtg 
1801 tcaggtcggg agctctgcta catcctgctg 
1861 ttcatcttca ttgccaagcc atccacggca 
1921 actgccttct ctgtctgcta ctcagccctg 
1981 ttcggtgggg cccgggaggg tgcccagcgg 
2041 gccatctgcc tggcacttat cttgggccag 
2101 gaggcaccgg gcacaggcaa ggagacagcc 
2161 tgcaaccacc gcgatgcaag tatgttgggc 
2221 ctctgcacgc tttatgcctt caagactcgc 
2281 ttcattggct tcaccatgta caccacctgc 
2341 tatgtcacct ccagtgacta ccgggtacag 
2401 agcggctccg tggtgcttgg ctgcctcttt 
2461 ccgcagaaga acgtggttag ccaccgggca 
2521 agggccagct ccagccttgg ccaagggtct 
2581 ggccgtgagg tggtggactc gacaacgtca 



ctgccgctgt ggggtgctgt ggctgagggc 
gacttggtgc tgggtgggct gttcccagtg 
ggtcctgtca atgagcaccg tggcatccag 
cgcatcaacc gtgacccgca cctgctgcct 
agttgctcca aggacacaca tgcgctggag 
agccgtggtg ctgatggctc acgccacatc 
gatgctccca ctgccatcac tggtgttatt 
gtggccaacc tcttgaggct atttcagatc 
aagctgagtg acaagtcccg ctatgactac 
caagccaagg ccatggctga gattctccgc 
gcgtctgagg gcgactatgg cgagacaggc 
cgcaacatct gtgtggccac ctcggagaaa 
gagggtgtgg tgcgagccct gctgcagaag 
cgttctgagg atgcccggga gctgcttgct 
tgggtggcca gtgatggttg gggggccctg 
gctgagggtg ctatcaccat cgagctggcc 
ttccagagcc tggacccttg gaacaacagc 
cagaggttcc gctgcagctt ccggcagcga 
ccctttgagc aggagtccaa gatcatgttt 
gcgctccaca acatgcaccg tgccctctgc 
cggccagtta acgggcgccg cctctacaag 
ccctttcgcc cagctgacac ccacaatgag 
ggccgctaca acatcttcac ctatctgcgt 
gtgggctact gggcagaagg cttgactctg 
tcagccggcc ccctgcccgc ctctcgctgc 
agtgtgcagc cgggcgaagt ctgctgctgg 
cgattggacg aattcacttg cgctgattgt 
actggctgct tcgaactgcc ccaggagtac 
cctgtcacca tcgcctgcct cggtgccctg 
cggcacaatg ccacaccagt ggtcaaggcc 
ggtggtgtct tcctctgcta ctgcatgacc 
gtgtgtacct tacggcgtct tggtttgggc 
ctcaccaaga ccaaccgcat tgcacgcatc 
ccacgcttca tcagtcctgc ctcacaggtg 
ctgctcatcg tggtcgcctg gctggtggtg 
cccgaacggc gggaggtggt gacactgcgc 
tcgctggcct acgatgtgct cctcatcgcg 
aagtgccccg aaaacttcaa cgaggccaag 
atcatctggc tggcattctt gcccatcttc 
accaccacca tgtgcgtgtc agtcagcctc 
gcgcccaagc tgcacatcat cctcttccag 
cccaccagcc gctttggcag tgctgctgcc 
ggctcccagt ttgtccccac tgtttgcaat 
tcgctttga 
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Seq ID 15s 6689V 

1 atgggatcgc tgcttgcgct cctggcactg 
61 ccagccaaga aggtgctgac cctggaggga 
121 caccagaagg gcggcccagc agaggactgt 
181 cgcctggagg ccatgctttt tgcactggac 
241 ggcgtgcgcG tgggtgcaca catcctcgac 
301 caggcactgg actttgtgcg tgcctcactc 
361 tgccccgacg gctcttatgc gacccatggt 
421 ggcggttcct acagtgatgt ctccatccag 
481 ccacagatta gctacgcctc taccagtgcc 
541 tttgcccgca cagtgcctcc tgacttcttc 
601 ttcttcaact ggacctatgt gtccactgtg 
661 attgaggcct ttgagctaga ggctcgtgcc 
721 gtgggccgtg ccatgagccg cgcggccttt 
781 cccagtgccc gcgtggctgt cctgttcacc 
841 gccagccagc gcctcaatgc cagcttcacc 
901 gagagtgtgg tggcaggcag tgagggggct 
961 tcctacccca tcagtgactt tgcctcctac 
1021 cggaacccct ggttccgtga attctgggag 
1081 gactgcgcag cccactctct ccgggctgtg 
1141 gtggtcaatg cagtgtacgc catggcccat 
1201 cccaacacca cccggctctg tgacgcgatg 
1261 gactttgtgc tcaacgtcaa gtttgatgcc 
1321 gtccgctttg accgctttgg tgatggtatt 
1381 gcaggcagtg ggcgctatcg ctaccagaag 
1441 gacaccagcc tcatcccatg ggcctcaccc 
1501 agtgagccct gcctccagaa tgaggtgaag 
1561 ctctgcattc cgtgccagcc ctatgagtac 
1621 ggcctgggct actggcccaa tgccagcctg 
1681 atccgctggg gcgatgcctg ggctgtggga 
1741 gccaccctct ttgtgctggg tgtctttgtg 
1801 tcaggtcggg agctctgcta catcctgctg 
1861 ttcatcttca ttgccaagcc atccacggca 
1921 actgccttct ctgtctgcta ctcagccctg 
1981 ttcggtgggg cccgggaggg tgcccagcgg 
2041 gccatctgcc tggcacttat ctcggtccag 
2101 gaggcaccgg gcacaggcaa ggagacagcc 
2161 tgcaaccacc gcgatgcaag tatgfctgggc 
2221 ctctgcacgc tttatgcctt caagactcgc 
2281 ttcattggct tcaccatgta caccacctgc 
2341 tatgtcacct ccagtgacta ccgggtacag 
2401 agcggctccg tggtgcttgg ctgcctcttt 
2461 ccgcagaaga acgtggttag ccaccgggca 
2521 agggccagct ccagccttgg ccaagggtct 
2581 ggccgtgagg tggtggactc gacaacgtca 



ctgccgctgt ggggtgctgt ggctgagggc 
gacttggtgc tgggtgggct gttcccagtg 
ggtcctgtca atgagcaccg tggcatccag 
cgcatcaacc gtgacccgca cctgctgcct 
agttgctcca aggacacaca tgcgctggag 
agccgtggtg ctgatggctc acgccacatc 
gatgctccca ctgccatcac tggtgttatt 
gtggccaacc tcttgaggct atttcagatc 
aagctgagtg acaagtcccg ctatgactac 
caagccaagg ccatggctga gattctccgc 
gcgtctgagg gcgactatgg cgagacaggc 
cgcaacatct gtgtggccac ctcggagaaa 
gagggtgtgg tgcgagccct gctgcagaag 
cgttctgagg atgcccggga gctgcttgct 
tgggtggcca gtgatggttg gggggccctg 
gctgagggtg ctatcaccat cgagctggcc 
ttccagagcc tggacccttg gaacaacagc 
cagaggttcc gctgcagctt ccggcagcga 
ccctttgagc aggagtccaa gatcatgttt 
gcgctccaca acatgcaccg tgccctctgc 
cggccagtta acgggcgccg cctctacaag 
ccctttcgcG cagctgacac ccacaatgag 
ggccgctaca acatcttcac ctatctgcgt 
gtgggctact gggcagaagg cttgactctg 
tcagccggcc ccctgcccgc ctctcgctgc 
agtgtgcagc cgggcgaagt ctgctgctgg 
cgattggacg aattcacttg cgctgattgt 
actggctgct tcgaactgcc ccaggagtac 
cctgtcacca tcgcctgcct cggtgccctg 
cggcacaatg ccacaccagt ggtcaaggcc 
ggtggtgtct tcctctgcta ctgcatgacc 
gtgtgtacct tacggcgtct tggtttgggc 
ctcaccaaga ccaaccgcat tgcacgcatc 
ccacgcttca tcagtcctgc ctcacaggtg 
ctgctcatcg tggtcgcctg gctggtggtg 
cccgaacggc gggaggtggt gacactgcgc 
tcgctggcct acaatgtgct cctcatcgcg 
aagtgccccg aaaacttcaa cgaggccaag 
atcatctggc tggcattctt gcccatcttc 
accaccacca tgtgcgtgtc agtcagcctc 
gcgcccaagc tgcacatcat cctcttccag 
cccaccagcc gctttggcag tgctgctgcc 
ggctcccagt ttgtccccac tgtttgcaat 
tcgctttga 
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Se<2 XD 16: S688I., G689V, A733T, sn35D 

1 atgggatcgc tgcttgcgct cctggcactg 
61 ccagccaaga aggtgctgac cctggaggga 
121 caccagaagg gcggcccagc agaggactgt 
181 cgcctggagg ccatgctttt tgcactggac 
241 ggcgtgcgcc tgggtgcaca catcctcgac 
301 caggcactgg actttgtgcg tgcctcactc 
3 61 tgccccgacg gctcttatgc gacccatggt 
421 ggcggttcct acagtgatgt ctccatccag 
481 ccacagatta gctacgcctc taccagtgcc 
541 tttgcccgca cagtgcctcc tgacttcttc 
601 ttcttcaact ggacctatgt gtccactgtg 
661 attgaggcct ttgagctaga ggctcgtgcc 
721 gtgggccgtg ccatgagccg cgcggccttt 
781 cccagtgccc gcgtggctgt cctgttcacc 
841 gccagccagc gcctcaatgc cagcttcacc 
901 gagagtgtgg tggcaggcag tgagggggct 
9 61 tcctacccca tcagtgactt tgcctcctac 
1021 cggaacccct ggttccgtga attctgggag 
1081 gactgcgcag cccactctct ccgggctgtg 
1141 gtggtcaatg cagtgtacgc catggcccat 
1201 cccaacacca cccggctctg tgacgcgatg 
1261 -gactttgtgc tcaacgtcaa gtttgatgcc 
1321 gtccgctttg accgctttgg tgatggtatt 
1381 gcaggcagtg ggcgctatcg ctaccagaag 
1441 gacaccagcc tcatcccatg ggcctcaccc 
1501 agtgagccct gcctccagaa tgaggtgaag 
1561 ctctgcattc cgtgccagcc ctatgagtac 
1621 ggcctgggct actggcccaa tgccagcctg 
1681 atccgctggg gcgatgcctg ggctgtggga 
1741 gccaccctct ttgtgctggg tgtctttgtg 
1801 tcaggtcggg agctctgcta catcctgctg 
1861 ttcatcttca ttgccaagcc atccacggca 
1921 actgccttct ctgtctgcta ctcagccctg 
1981 ttcggtgggg cccgggaggg tgcccagcgg 
2041 gccatctgcc tggcacttat cttggtccag 
2101 gaggcaccgg gcacaggcaa ggagacagcc 
2161 tgcaaccacc gcgatgcaag tatgttgggc 
2221 ctctgcacgc tttatgcctt caagactcgc 
2281 ttcattggct tcaccatgta caccacctgc 
2341 tatgtcacct ccagtgacta ccgggtacag 
2401 agcggctccg tggtgcttgg ctgcctcttt 
2461 ccgcagaaga acgtggttag ccaccgggca 
2521 agggccagct ccagccttgg ccaagggtct 
2581 ggccgtgagg tggtggactc gacaacgtca 



ctgccgctgt ggggtgctgt ggctgagggc 
gacttggtgc tgggtgggct gttcccagtg 
ggtcctgtca atgagcaccg tggcatccag 
cgcatcaacc gtgacccgca cctgctgcct 
agttgctcca aggacacaca tgcgctggag 
agccgtggtg ctgatggctc acgccacatc 
gatgctccca ctgccatcac tggtgttatt 
gtggccaacc tcttgaggct atttcagatc 
aagctgagtg acaagtcccg ctatgactac 
caagccaagg ccatggctga gattctccgc 
gcgtctgagg gcgactatgg cgagacaggc 
cgcaacatct gtgtggccac ctcggagaaa 
gagggtgtgg tgcgagccct gctgcagaag 
cgttctgagg atgcccggga gctgcttgct 
tgggtggcca gtgatggttg gggggccctg 
gctgagggtg ctatcaccat cgagctggcc 
ttccagagcc tggacccttg gaacaacagc 
cagaggttcc gctgcagctt ccggcagcga 
ccctttgagc aggagtccaa gatcatgttt 
gcgctccaca acatgcaccg tgccctctgc 
cggccagtta acgggcgccg cctctacaag 
ccctttcgcc cagctgacac ccacaatgag 
ggccgctaca acatcttcac ctatctgcgt 
gtgggctact gggcagaagg cttgactctg 
tcagccggcc ccctgcccgc ctctcgctgc 
agtgtgcagc cgggcgaagt ctgctgctgg 
cgattggacg aattcacttg cgctgattgt 
actggctgct tcgaactgcc ccaggagtac 
cctgtcacca tcgcctgcct cggtgccctg 
cggcacaatg ccacaccagt ggtcaaggcc 
ggtggtgtct tcctctgcta ctgcatgacc 
gtgtgtacct tacggcgtct tggtttgggc 
ctcaccaaga ccaaccgcat tgcacgcatc 
ccacgcttca tcagtcctgc ctcacaggtg 
ctgctcatcg tggtcgcctg gctggtggtg 
cccgaacggc gggaggtggt gacactgcgc 
tcgctgacct acgatgtgct cctcatcgcg 
aagtgccccg aaaacttcaa cgaggccaag 
atcatctggc tggcattctt gcccatcttc 
accaccacca tgtgcgtgtc agtcagcctc 
gcgcccaagc tgcacatcat cctcttccag 
cccaccagcc gctttggcag tgctgctgcc 
ggctcccagt ttgtccccac tgtttgcaat 
tcgctttga 
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Rat Sequences 
Seq ZD 17: N735D 

5 10 15 20 

Met Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 
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405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn lie Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe lie Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys lie Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

lie Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Ser Gly Gin Leu Leu He Val Ala Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 
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805 810 815 820 

>er Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His lie lie Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

^rg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
31y Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



5eq ID 18: 6689V, N735D 

5 10-15 20 

VLet Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly lie Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg lie Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His lie Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Tarp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 
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325 330 335 340 

Ser Tyx Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn lie Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe lie Pro Tan? Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys lie Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Ser Val Gin Leu Leu He Val Ala Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 
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725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

Arg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seq XD 19: S688Z1, 6689V, N735D 

-.5 10 15 20 

Met Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 
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245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala lie Thr lie Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn lie Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe lie Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys lie Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 



wo 2004/024936 



PCT/US2003/028470 



645 650 655 660 

rhr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg lie Ala Arg lie 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe lie Ser Pro Ala Ser Gin Val 

685 690 695 700 

hlsi lie Cys Leu Ala Leu He Leu Val Gin Leu Leu He Val Ala Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

Arg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seq ID 20: S688I. 

5 10 15 20 

Met Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Ar^ Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 
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165 170 175 180 

Pro Gin lie Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 • 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 
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565 570 575 580 

lie Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr lie Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr lie Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe lie Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg lie Ala Arg lie 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe lie Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala lie Cys Leu Ala Leu lie Leu Gly Gin Leu Leu lie Val Ala Ala Txp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val Leu Leu lie Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe lie Gly Phe Thr Met Tyr Thr Thr Cys lie lie Trp Leu Ala Phe Leu Pro lie Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His lie lie Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 ' 855 860 

Arg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seq ID 21s S688L, G689V 

5 10 15 20 

Met Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 
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85 90 95 ICQ 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130* 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala TVrg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 * 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 
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485 490 495 500 

\3p Thr Ser Phe lie Pro Trp Ala Ser Pro Ser Ala Gly Pro lieu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys lie Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

lie Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr lie Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr lie Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe lie Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg lie Ala Arg lie 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe lie Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala lie Cys Leu Ala Leu lie Leu Val Gin Leu Leu lie Val Ala Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg 2^g Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

Arg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

855 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 
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Seq ID 22: S688It, N735D 

5 10 15 20 

Met Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

His Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly lie Gin 

65 70 75 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro He Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys He Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 
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405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 5g0 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

T-, , ™ , 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 qqq 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

^25 630 635 640 

Phe Val Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Leu Gly Gin Leu Leu He Val Ala Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

^ , . ''^^ 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asp Val Leu Leu He Ala 

r ^ '^^^ '^^^ 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 



805 810 815 



820 
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;er Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

»ro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

irg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
;iy Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



;eq ID 23: 6689V 

5 10 15 20 

4et Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

d±3 Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 7S 80 

Arg Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Gly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

Gin Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly ' 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 

245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala He Thr He Glu Leu Ala 
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325 330 335 340 

3er Tyr Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

\sp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly He Gly Arg Tyr Asn He Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe lie Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 

Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 ' 695 700 

Ala He Cys Leu Ala Leu He Ser Val Gin Leu Leu He Val Ala Ala Trp Leu Val Val 

705 710 ' 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 73.0 735 740 
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ys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val Leu Leu lie Ala 

745 750 755 760 

eu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

he lie Gly Phe Thr Met Tyr Thr Thr Cys lie He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

yr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

:er Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

»ro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

irg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Jly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 



Seq XD 24: S688L, 6689V, A733T, N735D 

5 10 15 20 

let Glu Ser Leu Leu Gly Phe Leu Ala Leu Leu Leu Leu Trp Gly Ala Val Ala Glu Gly 

25 30 35 40 

Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Val 

45 50 55 60 

iis Gin Lys Gly Gly Pro Ala Glu Glu Cys Gly Pro Val Asn Glu His Arg Gly He Gin 

65 70 75 80 

?^g Leu Glu Ala Met Leu Phe Ala Leu Asp Arg He Asn Arg Asp Pro His Leu Leu Pro 

85 90 95 100 

Sly Val Arg Leu Gly Ala His He Leu Asp Ser Cys Ser Lys Asp Thr His Ala Leu Glu 

105 110 115 120 

31n Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg Gly Ala Asp Gly Ser Arg His He 

125 130 135 140 

Cys Pro Asp Gly Ser Tyr Ala Thr His Ser Asp Ala Pro Thr Ala Val Thr Gly Val He 

145 150 155 160 

Gly Gly Ser Tyr Ser Asp Val Ser He Gin Val Ala Asn Leu Leu Arg Leu Phe Gin He 

165 170 175 180 

Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr 

185 190 195 200 

Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gin Ala Lys Ala Met Ala Glu He Leu Arg 

205 210 215 220 

Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly 

225 230 235 240 

He Glu Ala Phe Glu Leu Glu Ala Arg Ala Arg Asn He Cys Val Ala Thr Ser Glu Lys 
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245 250 255 260 

Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala Leu Leu Gin Lys 

265 270 275 280 

Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser Glu Asp Ala Arg Glu Leu Leu Ala 

285 290 295 300 

Ala Thr Gin Arg Leu Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu 

305 310 315 320 

Glu Ser Val Val Ala Gly Ser Glu Arg Ala Ala Glu Gly Ala lie Thr lie Glu Leu Ala 

325 330 335 340 

Ser Tyr Pro lie Ser Asp Phe Ala Ser Tyr Phe Gin Ser Leu Asp Pro Trp Asn Asn Ser 

345 350 355 360 

Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Glu Arg Phe His Cys Ser Phe Arg Gin Arg 

365 370 375 380 

Asp Cys Ala Ala His Ser Leu Arg Ala Val Pro Phe Glu Gin Glu Ser Lys lie Met Phe 

385 390 395 400 

Val Val Asn Ala Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 

405 410 415 420 

Pro Asn Thr Thr His Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg Arg Leu Tyr Lys 

425 430 435 440 

Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe Arg Pro Ala Asp Thr Asp Asp Glu 

445 450 455 460 

Val Arg Phe Asp Arg Phe Gly Asp Gly lie Gly Arg Tyr Asn lie Phe Thr Tyr Leu Arg 

465 470 475 480 

Ala Gly Ser Gly Arg Tyr Arg Tyr Gin Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 

485 490 495 500 

Asp Thr Ser Phe He Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Pro Ala Ser Arg Cys 

505 510 515 520 

Ser Glu Pro Cys Leu Gin Asn Glu Val Lys Ser Val Gin Pro Gly Glu Val Cys Cys Trp 

525 530 535 540 

Leu Cys He Pro Cys Gin Pro Tyr Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys 

545 550 555 560 

Gly Leu Gly Tyr Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gin Glu Tyr 

565 570 575 580 

He Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr He Ala Cys Leu Gly Ala Leu 

585 590 595 600 

Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His Asn Ala Thr Pro Val Val Lys Ala 

605 610 615 620 

Ser Gly Arg Glu Leu Cys Tyr He Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr 

625 630 635 640 

Phe Val Phe He Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 

645 650 655 660 
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rhr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg He Ala Arg He 

665 670 675 680 

Phe Gly Gly Ala Arg Glu Gly Ala Gin Arg Pro Arg Phe He Ser Pro Ala Ser Gin Val 

685 690 695 700 

Ala He Cys Leu Ala Leu He Leu Val Gin Leu Leu He Val Ala Ala Trp Leu Val Val 

705 710 715 720 

Glu Ala Pro Gly Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 

725 730 735 740 

Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Thr Tyr Asp Val Leu Leu He Ala 

745 750 755 760 

Leu Cys Thr Leu Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala Lys 

765 770 775 780 

Phe He Gly Phe Thr Met Tyr Thr Thr Cys He He Trp Leu Ala Phe Leu Pro He Phe 

785 790 795 800 

Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin Thr Thr Thr Met Cys Val Ser Val Ser Leu 

805 810 815 820 

Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His He He Leu Phe Gin 

825 830 835 840 

Pro Gin Lys Asn Val Val Ser His Arg Ala Pro Thr Ser Arg Phe Gly Ser Ala Ala Pro 

845 850 855 860 

Arg Ala Ser Ala Asn Leu Gly Gin Gly Ser Gly Ser Gin Phe Val Pro Thr Val Cys Asn 

865 870 
Gly Arg Glu Val Val Asp Ser Thr Thr Ser Ser Leu Ter 
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Se(2 XD 25: N735D 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
301 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
351 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccat caeca t 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
17 01 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat ctcgggccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctggcct 
2201 acgatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga // 
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Sed ZD 26ft G689V, N735D 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
3 01 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
351 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccatcaccat 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
1701 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat ctcggtccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctggcct 
2201 acgatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga // 
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Seq XD 27s S688L, 6689V, N735D 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
301 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
351 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccatcaccat 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
1701 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat cttggtccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctggcct 
2201 acgatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga // 



wo 2004/024936 



PCTAJS2003/028470 



Seq XD 28: S688I1 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 



atggaatcac 
ggccgagggc 
tgggtgggct 
ggacctgtta 
tgcactggac 
tgggtgcgca 
caggcgctgg 
acgccacatc 
cagctgtcac 
gtggccaatc 
caccagtgcc 
ctgtgccccc 
tttttcaact 
tgagacaggc 
gcgtggccac 
gagggcgtgg 
gctcttcacc 
gcctcaacgc 
gagagcgtgg 
tgaactggcc 
tggatccctg 
gagaggttcc 
gcgggccgtg 
ccgtctatgc 
cccaacacca 
cctctacaaa 
cagcagatac 
ggccgctaca 
ctaccagaag 
tcattccatg 
agcgagccct 
ctgctgttgg 
agttcacctg 
actggctgct 
ggcggtggga 
ttgtgttggg 
tccggtcggg 
ttgtatgacc 
tgaggcgcct 
ctcaccaaga 
tgcccagcgg 
tggcacttat 
gaggcacctg 
gacattgcgc 
acaatgtgct 
aagtgcccgg 
caccacctgc 
ccagtgatta 
agtggctctg 
ccttttccag 
gctttggcag 
ggatcccagt 
aacaacgtcg 



tgcttgggtt 
ccggccaaga 
gttcccagtg 
atgagcaccg 
cgcatcaacc 
catcctcgac 
actttgtgcg 
tgtcctgatg 
cggtgtcatt 
tcctgcggct 
aagctgagtg 
agacttcttc 
ggacatatgt 
attgaggctt 
ttctgagaag 
tgcgagccct 
cggtccgagg 
cagcttcaca 
tggcaggcag 
tcctacccca 
gaacaacagc 
attgcagctt 
ccctttgaac 
catggcccac 
cccacctctg 
gacttcgtgc 
tgacgatgag 
acatcttcac 
gtaggctact 
ggcctcccca 
gccttcagaa 
ctctgcattc 
cgctgactgt 
ttgagctgcc 
cctgtcacca 
tgtctttgtg 
agctttgcta 
ttcgtcttca 
cggtttgggt 
ccaatcgcat 
ccacgcttca 
cttgggccag 
gcacaggcaa 
tgtaaccacc 
cctcatcgcg 
agaacttcaa 
atcatctggc 
tcgggtgcag 
tggtgcttgg 
ccacagaaga 
cgctgccccc 
ttgttcccac 
tcgctttga 



tctggcactg 
aggtgctgac 
caccagaagg 
aggcatacag 
gcgaccccca 
agctgctcca 
tgcctcactc 
gctcctatgc 
ggtggctcct 
gttccagatc 
acaagtcccg 
caagccaagg 
gtctacggtg 
tcgagctcga 
gtgggccgtg 
gttgcagaaa 
atgcccgtga 
tgggtggcca 
tgaaagggct 
tcagtgactt 
agaaaccctt 
ccggcagcga 
aggagtcaaa 
gctctacaca 
cgatgctatg 
tcaatgtcaa 
gtccgcttcg 
ctatctgcgg 
gggcagaagg 
tcagccggac 
cgaggtgaag 
cctgtcagcc 
ggcctgggct 
ccaggagtac 
tcgcctgcct 
aggcataatg 
cattctgctg 
ttgctaagcc 
accgccttct 
tgctcgcata 
tcagtcccgc 
ctgctcattg 
ggagacagcc 
gtgacgcgag 
ctctgcacgc 
cgaagccaag 
tggctttcct 
accaccacga 
ctgcctcttt 
atgtggtgag 
agggccagcg 
tgtttgcaac 
// 



ctgctgctgt 
cctggagggg 
gtggcccagc 
cgcctagagg 
cctgctgcct 
aggatacaca 
agtcgtggcg 
cacccacagt 
acagtgatgt 
ccacagatca 
ttacgattac 
ccatggctga 
gcatctgagg 
ggctcgggca 
ccatgagccg 
cccagtgccc 
gctgcttgca 
gcgacggctg 
gctgagggcg 
tgcttcctac 
ggttccgtga 
gactgtgccg 
gatcatgttt 
acatgcaccg 
aggcctgtca 
gtttgacgcc 
accgctttgg 
gcaggcagtg 
tctgactctg 
ctcttcctgc 
agcgtgcagc 
ctatgagtac 
actggcctaa 
atccgctggg 
gggtgccctg 
ccacacccgt 
ggcggtgtct 
ttccacggcc 
ctgtctgcta 
tttggcgggg 
ctcacaggtg 
tcgctgcctg 
cctgaacggc 
catgctcggc 
tctatgcctt 
ttcatcggct 
tcctatcttc 
tgtgcgtgtc 
gcacccaagt 
ccaccgggca 
ccaaccttgg 
ggccgtgagg 



ggggtgccgt 

gacctggtgc 
cgaggagtgt 
ctatgctttt 
ggtgtgcgct 
cgccctggag 
ctgacggctc 
gatgctccta 
ctccatccag 
gctatgcctc 
tttgctcgca 
gattctccgc 
gtgactatgg 
cgcaacatct 
cgctgccttc 
gtgtggctgt 
gccacccagc 

gggggccctg 
ccatcaccat 
ttccagagct 
gttctgggag 
cccactctct 
gtggttaatg 
tgccctctgt 
atgggcgccg 
ccctttcgcc 
tgacggtatt 
ggcgctatcg 
gacactagct 
ctctcgctgt 
cgggcgaggt 
aggctggatg 
tgccagtctg 
gtgatgcctg 
gcgacactct 
ggtcaaggct 
tcctttgcta 
gtctgtacct 
ctcagccctc 
cccgggaggg 
gccatctgct 
gctggtggtg 
gggaagtggt 
tctctggcct 
caagacccgc 
tcaccatgta 
tatgtcacct 
cgtcagcctc 
tgcacatcat 
cctaccagcc 
tcaagggtct 
tggtggactc 



wo 2004/024936 



PCTAJS2003/028470 



Seq XX> 29 s S688I1, G689V 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
301 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
3 51 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccatcaccat 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
17 01 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat cttggtccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctggcct 
22 01 acaatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga // 



wo 2004/024936 



PCT/US2003/028470 



Seq XD 30: S688I«, Xr735D 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
301 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
351 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccatcaccat 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
1701 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat cttgggccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctggcct 
2201 acgatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga // 



wo 2004/024936 



PCT/US2003/028470 



Seq XD 31s G698V 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
3 01 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
351 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccatcaccat 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
1701 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat ctcggtccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctggcct 
2201 acaatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga // 



wo 2004/024936 



PCT/US2003/028470 



Seq ID 32 s S688I., G689V, A733T, N735D 

1 atggaatcac tgcttgggtt tctggcactg ctgctgctgt ggggtgccgt 
51 ggccgagggc ccggccaaga aggtgctgac cctggagggg gacctggtgc 
101 tgggtgggct gttcccagtg caccagaagg gtggcccagc cgaggagtgt 
151 ggacctgtta atgagcaccg aggcatacag cgcctagagg ctatgctttt 
201 tgcactggac cgcatcaacc gcgaccccca cctgctgcct ggtgtgcgct 
251 tgggtgcgca catcctcgac agctgctcca aggatacaca cgccctggag 
301 caggcgctgg actttgtgcg tgcctcactc agtcgtggcg ctgacggctc 
351 acgccacatc tgtcctgatg gctcctatgc cacccacagt gatgctccta 
401 cagctgtcac cggtgtcatt ggtggctcct acagtgatgt ctccatccag 
451 gtggccaatc tcctgcggct gttccagatc ccacagatca gctatgcctc 
501 caccagtgcc aagctgagtg acaagtcccg ttacgattac tttgctcgca 
551 ctgtgccccc agacttcttc caagccaagg ccatggctga gattctccgc 
601 tttttcaact ggacatatgt gtctacggtg gcatctgagg gtgactatgg 
651 tgagacaggc attgaggctt tcgagctcga ggctcgggca cgcaacatct 
701 gcgtggccac ttctgagaag gtgggccgtg ccatgagccg cgctgccttc 
751 gagggcgtgg tgcgagccct gttgcagaaa cccagtgccc gtgtggctgt 
801 gctcttcacc cggtccgagg atgcccgtga gctgcttgca gccacccagc 
851 gcctcaacgc cagcttcaca tgggtggcca gcgacggctg gggggccctg 
901 gagagcgtgg tggcaggcag tgaaagggct gctgagggcg ccatcaccat 
951 tgaactggcc tcctacccca tcagtgactt tgcttcctac ttccagagct 
1001 tggatccctg gaacaacagc agaaaccctt ggttccgtga gttctgggag 
1051 gagaggttcc attgcagctt ccggcagcga gactgtgccg cccactctct 
1101 gcgggccgtg ccctttgaac aggagtcaaa gatcatgttt gtggttaatg 
1151 ccgtctatgc catggcccac gctctacaca acatgcaccg tgccctctgt 
1201 cccaacacca cccacctctg cgatgctatg aggcctgtca atgggcgccg 
1251 cctctacaaa gacttcgtgc tcaatgtcaa gtttgacgcc ccctttcgcc 
1301 cagcagatac tgacgatgag gtccgcttcg accgctttgg tgacggtatt 
1351 ggccgctaca acatcttcac ctatctgcgg gcaggcagtg ggcgctatcg 
1401 ctaccagaag gtaggctact gggcagaagg tctgactctg gacactagct 
1451 tcattccatg ggcctcccca tcagccggac ctcttcctgc ctctcgctgt 
1501 agcgagccct gccttcagaa cgaggtgaag agcgtgcagc cgggcgaggt 
1551 ctgctgttgg ctctgcattc cctgtcagcc ctatgagtac aggctggatg 
1601 agttcacctg cgctgactgt ggcctgggct actggcctaa tgccagtctg 
1651 actggctgct ttgagctgcc ccaggagtac atccgctggg gtgatgcctg 
1701 ggcggtggga cctgtcacca tcgcctgcct gggtgccctg gcgacactct 
1751 ttgtgttggg tgtctttgtg aggcataatg ccacacccgt ggtcaaggct 
1801 tccggtcggg agctttgcta cattctgctg ggcggtgtct tcctttgcta 
1851 ttgtatgacc ttcgtcttca ttgctaagcc ttccacggcc gtctgtacct 
1901 tgaggcgcct cggtttgggt accgccttct ctgtctgcta ctcagccctc 
1951 ctcaccaaga ccaatcgcat tgctcgcata tttggcgggg cccgggaggg 
2001 tgcccagcgg ccacgcttca tcagtcccgc ctcacaggtg gccatctgct 
2051 tggcacttat cttggtccag ctgctcattg tcgctgcctg gctggtggtg 
2101 gaggcacctg gcacaggcaa ggagacagcc cctgaacggc gggaagtggt 
2151 gacattgcgc tgtaaccacc gtgacgcgag catgctcggc tctctgacct 
2201 acgatgtgct cctcatcgcg ctctgcacgc tctatgcctt caagacccgc 
2251 aagtgcccgg agaacttcaa cgaagccaag ttcatcggct tcaccatgta 
2301 caccacctgc atcatctggc tggctttcct tcctatcttc tatgtcacct 
2351 ccagtgatta tcgggtgcag accaccacga tgtgcgtgtc cgtcagcctc 
2401 agtggctctg tggtgcttgg ctgcctcttt gcacccaagt tgcacatcat 
2451 ccttttccag ccacagaaga atgtggtgag ccaccgggca cctaccagcc 
2501 gctttggcag cgctgccccc agggccagcg ccaaccttgg tcaagggtct 
2551 ggatcccagt ttgttcccac tgtttgcaac ggccgtgagg tggtggactc 
2601 aacaacgtcg tcgctttga 
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Hxiinan mGliiR3 sequences 
Sea ZD 33: D744N 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu lie Lys lie Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro lie Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
lie 

65 70 75 

80 

Asn Glu Asp Arg Gly lie Gin Arg Leu Glu Ala Met Leu Phe Ala He Asp Glu He 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His He Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala He Gin Glu Asn He Pro 
Leu 

145 150 155 

160 

Leu He Ala Gly Val He Gly Gly Ser Tyr Ser Ser Val Ser He Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 

225 230 235 

240 

Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 
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245 250 255 

260 

He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 

265 270 275 

280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 

285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 

305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser He He Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 

325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala He 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys He Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys He Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser He Val Lys Phe Asp Thr Phe 
Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 
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485 490 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser lie His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr He Arg Trp Glu Asp Ala Trp Ala He Gly 
Pro 

585 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe He 
Lys 

605 610 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys 'Tyr He Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe* Phe Phe He Ala Lys Pro Ser Pro Val 
He 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Leu Val Gin 
He 

705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 
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725 730 735 

740 

Glu Lys Arg Glu Thr Val lie Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu lie 
Ser 

745 750 755 

760 

Leu Thr Tyr Asn Val lie Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 

785 790 795 

800 

He Tip Leu Ala Phe Leu Pro He Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys He Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His He He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Seq ID 34: V6986, D744N 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu He Lys He Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro He Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
He 

65 70 75 

80 

Asn Glu Asp Arg Gly He Gin Arg Leu Glu Ala Met Leu Phe Ala He Asp Glu He 
Asn 
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85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His He Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala He Gin Glu Asn He Pro 
Leu 

145 150 155 

160 

Leu He Ala Gly Val He Gly Gly Ser Tyr Ser Ser Val Ser He Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 

Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 



225 230 235 

240 

Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 

245 250 255 

260 

He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 

265 270 275 

280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 

285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 

305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser He He Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 
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325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala He 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys He Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys He Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser He Val Lys Phe Asp Thr Phe 
Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser He His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 
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565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr lie Arg Trp Glu Asp Ala Trp Ala He Gly 
Pro 

585 ^ 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe He 
Lys 

605 610 • 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr He Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe He Ala Lys Pro Ser Pro Val 
He 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Leu Gly Gin 
He 

705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Thr Tyr Asn Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 

785 790 795 

800 

He Trp Leu Ala Phe Leu Pro He Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 
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805 810 815 

820 

Thr Thr Met Cys lie Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Piie 
Ala 

825 830 835 

840 

Pro Lys Val His lie He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Se<I ID 35 s I.697S, V698G, D744N 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu He Lys He Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro He Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
He 

65 70 75 

80 

Asn Glu Asp Arg Gly He Gin Arg Leu Glu Ala Met Leu Phe Ala He Asp Glu He 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His He Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala He Gin Glu Asn He Pro 
Leu 

145 150 155 

160 
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Leu lie Ala Gly Val lie Gly Gly Ser Tyr Ser Ser Val Ser lie Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 

225 230 235 

240 

Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 

245 250 255 

260 

He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 

265 270 275 

280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 

285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 

305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser He He Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 

325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 

Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala He 
Asp 

385 390 395 

400 
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Ser Ser Asn Tyr Glu Gin Glu Ser Lys He Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys He Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser He Val Lys Phe Asp Thr Phe 
Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser He His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr He Arg Tirp Glu Asp Ala Trp Ala He Gly 
Pro 

585 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe He 
Lys 

605 610 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr He Leu Leu 
Phe 

625 630 635 

640 
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Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe lie Ala Lys Pro Ser Pro Val 
lie 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Ser Gly Gin 
He 

705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Thr Tyr Asn Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 

785 790 795 

800 

He Trp Leu Ala Phe Leu Pro He Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys He Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His He He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 
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Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Se<2 XD 36: Ii697S 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu lie Lys lie Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro lie Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
He 

65 70 75 

80 

Asn Glu Asp Arg Gly He Gin Arg Leu Glu Ala Met Leu Phe Ala He Asp Glu He 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His He Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala He Gin Glu Asn He Pro 
Leu 

145 150 155 

160 

Leu He Ala Gly Val He Gly Gly Ser Tyr Ser Ser Val Ser He Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 
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225 



230 



235 



240 



Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 



245 



250 



255 



260 



He Ala Thr Ala Glu I#ys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 



265 



270 



275 



280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 



285 



290 



295 



300 



Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 



305 



310 



315 



320 



Asp Gly Trp Gly Ala Gin Glu Ser He He Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 



325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala He 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys He Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys He Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser He Val Lys Phe Asp Thr Phe 
Gly 
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465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser lie His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr He Arg Trp Glu Asp Ala Trp Ala He Gly 
Pro 

585 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe He 
Lys 

605 610 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr He Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe He Ala Lys Pro Ser Pro Val 
He 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Ser Val Gin 
He 
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705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Thr Tyr Asp Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 

785 790 795 

800 

He Trp Leu Ala Phe Leu Pro He Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys He Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His He He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Seq XD 37: Zf697S, V698G 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu He Lys He Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro He Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
He 
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65 70 75 

80 

Asn Glu Asp Arg Gly lie Gin Arg Leu Glu Ala Met Leu Phe Ala Xle Asp Glu lie 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His lie Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala lie Gin Glu Asn lie Pro 
Leu 

145 150 155 

160 

Leu lie Ala Gly Val lie Gly Gly Ser Tyr Ser Ser Val Ser lie Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 



185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 

225 230 235 

240 

Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 

245 250 255 

260 

He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 

265 270 275 

280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 

285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 
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305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser lie lie Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 

325 330 335 

340 

lie Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe 2^g Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala lie 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys lie Met Phe Val Val Asn Ala Val Tyr Ala 

Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys lie Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser He Val Lys Phe Asp Thr Phe 

Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 * 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser He His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 
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545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr He Arg Trp Glu Asp Ala Trp Ala He Gly 
Pro 

585 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe He 
Lys 

605 610 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr He Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe He Ala Lys Pro Ser Pro Val 
He 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Ser Gly Gin 
He 

705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Thr Tyr Asp Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 
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785 790 795 

800 

lie Trp Leu Ala Phe Leu Pro He Phe Tyr Val Tlir Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys He Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His He He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Seq ID 38: I.697S, D744N 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu He Lys He Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro He Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
He 

65 70 75 

80 

Asn Glu Asp Arg Gly He Gin Arg Leu Glu Ala Met Leu Phe Ala He Asp Glu He 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His He Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 
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Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala lie Gin Glu Asn lie Pro 
Leu 

145 150 155 

160 

Leu lie Ala Gly Val He Gly Gly Ser Tyr Ser Ser Val Ser He Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 

225 230 235 

240 

Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 

245 250 255 

260 

He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He 2^g Lys Ser Tyr Asp Ser Val 
He 

265 270 275 

280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 

285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 

305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser He He Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 

325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 
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Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala lie 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys lie Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys lie Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys lie Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser lie Val Lys Phe Asp Thr Phe 
Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser lie His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr He Arg Trp Glu Asp Ala Trp Ala He Gly 
Pro 

585 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cy^ Met Val Val Thr Val Phe He 
Lys 

605 610 615 

620 
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His Asn Asn Ttir Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr lie Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe lie Ala Lys Pro Ser Pro Val 
lie 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala lie Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He. Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Ser Val Gin 
He 

705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Thr Tyr Asn Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 

785 790 795 

800 

He Trp Leu Ala Phe Leu Pro He Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys He Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His He He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 
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Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Seq ZD 39: V698G 

5 * 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu lie Lys lie Glu Gly Asp Leu 
Val 

45 50 55 

60 

Leu Gly Gly Leu Phe Pro lie Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
lie 

65 70 75 

80 

Asn Glu Asp Arg Gly lie Gin Arg Leu Glu Ala Met Leu Phe Ala lie Asp Glu lie 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His lie Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala lie Gin Glu Asn lie Pro 
Leu 

145 150 155 

160 

Leu lie Ala Gly Val lie Gly Gly Ser Tyr Ser Ser Val Ser lie Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin lie Pro Gin lie Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 
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205 



210 



215 



220 



Met Ala Glu lie Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 



225 



230 



235 



240 



Asp Tyr Gly Glu Thr Gly lie Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 



245 



250 



255 



260 



He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 



265 



270 



275 



280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 



285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 

305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser He He Lys Gly Ser Glu His Val Ala Tyr Gly 
Asp 

325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala He 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys He Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

. 440 

Ala Met Lys He Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 
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445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser lie Val Lys Phe Asp Thr Phe 
Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 495 

500 

Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser lie His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 

525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp He Cys He Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr He Arg Trp Glu Asp Ala Trp Ala He Gly 
Pro 

585 590 595 

600 

Val Thr He Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe He 
Lys 

605 610 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr He Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe He Ala Lys Pro Ser Pro Val 
He 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 
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685 690 695 

700 

Lys Phe lie Ser Pro Ser Ser Gin Val Phe lie Cys Leu Gly Leu lie Leu Gly Gin 
lie 

705 710 715 

720 

Val Met Val Ser Val Trp Leu lie Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Thr Tyr Asp Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 

765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe He Gly Phe Thr Met Tyr Thr Thr Cys 
He 

785 790 795 

800 

He Trp Leu Ala Phe Leu Pro He Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys He Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His He He Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 

Seq ID 40: Z.697S, V6986, T742A, D744II 

5 10 15 

Met Lys Met Leu Thr Arg Leu Gin Val Leu Thr Leu Ala Leu Phe Ser Lys Gly Phe 
Leu 

25 30 35 

40 

Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu He Lys He Glu Gly Asp Leu 
val 
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45 50 55 

60 

Leu Gly Gly Leu Phe Pro lie Asn Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg 
lie 

65 70 75 

80 

Asn Glu Asp Arg Gly lie Gin Arg Leu Glu Ala Met Leu Phe Ala He Asp Glu He 
Asn 

85 90 95 

100 

Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His He Leu Asp Thr Cys 
Ser 

105 110 115 

120 

Arg Asp Thr Tyr Ala Leu Glu Gin Ser Leu Glu Phe Val Arg Ala Ser Leu Thr Lys 
Val 

125 130 135 

140 

Asp Glu Ala Glu Tyr Met Cys Pro Asp Gly Ser Tyr Ala He Gin Glu Asn He Pro 
Leu 



145 150 155 

160 

Leu He Ala Gly Val He Gly Gly Ser Tyr Ser Ser Val Ser He Gin Val Ala Asn 
Leu 

165 170 175 

180 

Leu Arg Leu Phe Gin He Pro Gin He Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser 
Asp 

185 190 195 

200 

Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Tyr Gin Ala Lys 
Ala 

205 210 215 

220 

Met Ala Glu He Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu 
Gly 

225 230 . 235 

240 

Asp Tyr Gly Glu Thr Gly He Glu Ala Phe Glu Gin Glu Ala Arg Leu Arg Asn He 
Cys 

245 250 255 

260 

He Ala Thr Ala Glu Lys Val Gly Arg Ser Asn He Arg Lys Ser Tyr Asp Ser Val 
He 

265 270 275 

280 

Arg Glu Leu Leu Gin Lys Pro Asn Ala Arg Val Val Val Leu Phe Met Arg Ser Asp 
Asp 
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285 290 295 

300 

Ser Arg Glu Leu He Ala Ala Ala Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala 
Ser 

305 310 315 

320 

Asp Gly Trp Gly Ala Gin Glu Ser He He Dys Gly Ser Glu His Val Ala Tyr Gly 
Asp 

325 330 335 

340 

He Thr Leu Glu Leu Ala Ser Gin Pro Val Arg Gin Phe Gly Arg Tyr Phe Gin Ser 
Leu 

345 350 355 

360 

Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg Asp Phe Trp Glu Gin Lys Phe 
Gin 

365 370 375 

380 

Cys Ser Leu Gin Asn Lys Arg Asn His Arg Arg Val Cys Glu Lys His Leu Ala He 
Asp 

385 390 395 

400 

Ser Ser Asn Tyr Glu Gin Glu Ser Lys He Met Phe Val Val Asn Ala Val Tyr Ala 
Met 

405 410 415 

420 

Ala His Ala Leu His Lys Met Gin Arg Thr Leu Cys Pro Asn Thr Thr Lys Leu Cys 
Asp 

425 430 435 

440 

Ala Met Lys He Leu Asp Gly Lys Lys Leu Tyr Lys Asp Tyr Leu Leu Lys He Asn 
Phe 

445 450 455 

460 

Thr Ala Pro Phe Asn Pro Asn Lys Asp Ala Asp Ser He Val Lys Phe Asp Thr Phe 
Gly 

465 470 475 

480 

Asp Gly Met Gly Arg Tyr Asn Val Phe Asn Phe Gin Asn Val Gly Gly Lys Tyr Ser 
Tyr 

485 490 495 

500 

Leu Lys Val Gly His Tzrp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser He His Trp 
Ser 

505 510 515 

520 

Arg Asn Ser Val Pro Thr Ser Gin Cys Ser Asp Pro Cys Ala Pro Asn Glu Met Lys 
Asn 
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525 530 535 

540 

Met Gin Pro Gly Asp Val Cys Cys Trp lie Cys lie Pro Cys Glu Pro Tyr Glu Tyr 
Leu 

545 550 555 

560 

Ala Asp Glu Phe Thr Cys Met Asp Cys Gly Ser Gly Gin Trp Pro Thr Ala Asp Leu 
Thr 

565 570 575 

580 

Gly Cys Tyr Asp Leu Pro Glu Asp Tyr lie Arg Trp Glu Asp Ala Trp Ala lie Gly 
Pro 

585 590 595 

600 

Val Thr lie Ala Cys Leu Gly Phe Met Cys Thr Cys Met Val Val Thr Val Phe lie 
Lys 

605 610 615 

620 

His Asn Asn Thr Pro Leu Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr He Leu Leu 
Phe 

625 630 635 

640 

Gly Val Gly Leu Ser Tyr Cys Met Thr Phe Phe Phe He Ala Lys Pro Ser Pro Val 
He 

645 650 655 

660 

Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala He Cys Tyr Ser Ala Leu 
Leu 

665 670 675 

680 

Thr Lys Thr Asn Cys He Ala Arg He Phe Asp Gly Val Lys Asn Gly Ala Gin Arg 
Pro 

685 690 695 

700 

Lys Phe He Ser Pro Ser Ser Gin Val Phe He Cys Leu Gly Leu He Ser Gly Gin 
He 

705 710 715 

720 

Val Met Val Ser Val Trp Leu He Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu 
Ala 

725 730 735 

740 

Glu Lys Arg Glu Thr Val He Leu Lys Cys Asn Val Lys Asp Ser Ser Met Leu He 
Ser 

745 750 755 

760 

Leu Ala Tyr Asn Val He Leu Val He Leu Cys Thr Val Tyr Ala Phe Lys Thr Arg 
Lys 
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765 770 775 

780 

Cys Pro Glu Asn Phe Asn Glu Ala Lys Phe lie Gly Phe Thr Met Tyr Thr Thr Cys 
lie 

785 790 795 

800 

lie Trp Leu Ala Phe Leu Pro lie Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gin 
Thr 

805 810 815 

820 

Thr Thr Met Cys lie Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly Cys Leu Phe 
Ala 

825 830 835 

840 

Pro Lys Val His lie lie Leu Phe Gin Pro Gin Lys Asn Val Val Thr His Arg Leu 
His 

845 850 855 

860 

Leu Asn Arg Phe Ser Val Ser Gly Thr Gly Thr Thr Tyr Ser Gin Ser Ser Ala Ser 
Thr 

865 870 875 

880 

Tyr Val Pro Thr Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 
Ter 
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Seq ID 41: D744N 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatcct ggtgcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctaca atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 



wo 2004/024936 
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Seq ID 42: V6986, D744N 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatcct ggggcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctaca atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 
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Seg ID 43: ZI697S, V6986, D744N 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatctc ggggcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctaca atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 
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Seq ID 44: Ii697S 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatctc ggtgcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctacg atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 
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Seq ID 45: Ii697S, V6986 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatctc ggggcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctacg atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 
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Seq ID 46: L697S, D744N 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
13 51 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatctc ggtgcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctaca atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 
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Seg ID 47: V698G 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
701 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatcct ggggcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttacctacg atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 
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Seq ID 48: L697S, V698G, T742A^ D744N 

1 atgaagatgt tgacaagact gcaagttctt accttagctt tgttttcaaa 
51 gggattttta ctctctttag gggaccataa ctttctaagg agagagatta 
101 aaatagaagg tgaccttgtt ttagggggcc tgtttcctat taacgaaaaa 
151 ggcactggaa ctgaagaatg tgggcgaatc aatgaagacc gagggattca 
201 acgcctggaa gccatgttgt ttgctattga tgaaatcaac aaagatgatt 
251 acttgctacc aggagtgaag ttgggtgttc acattttgga tacatgttca 
301 agggatacct atgcattgga gcaatcactg gagtttgtca gggcatcttt 
351 gacaaaagtg gatgaagctg agtatatgtg tcctgatgga tcctatgcca 
501 ttcaagaaaa catcccactt ctcattgcag gggtcattgg tggctcttat 
551 agcagtgttt ccatacaggt ggcaaacctg ctgcggctct tccagatccc 
501 tcagatcagc tacgcatcca ccagcgccaa actcagtgat aagtcgcgct 
551 atgattactt tgccaggacc gtgccccccg acttctacca ggccaaagcc 
601 atggctgaga tcttgcgctt cttcaactgg acctacgtgt ccacagtagc 
651 ctccgagggt gattacgggg agacagggat cgaggccttc gagcaggaag 
7 01 cccgcctgcg caacatctgc atcgctacgg cggagaaggt gggccgctcc 
751 aacatccgca agtcctacga cagcgtgatc cgagaactgt tgcagaagcc 
801 caacgcgcgc gtcgtggtcc tcttcatgcg cagcgacgac tcgcgggagc 
851 tcattgcagc cgccagccgc gccaatgcct ccttcacctg ggtggccagc 
901 gacggttggg gcgcgcagga gagcatcatc aagggcagcg agcatgtggc 
951 ctacggcgac atcaccctgg agctggcctc ccagcctgtc cgccagttcg 
1001 gccgctactt ccagagcctc aacccctaca acaaccaccg caacccctgg 
1051 ttccgggact tctgggagca aaagtttcag tgcagcctcc agaacaaacg 
1101 caaccacagg cgcgtctgcg aaaagcacct ggccatcgac agcagcaact 
1151 acgagcaaga gtccaagatc atgtttgtgg tgaacgcggt gtatgccatg 
1201 gcccacgctt tgcacaaaat gcagcgcacc ctctgtccca acactaccaa 
1251 gctttgtgat gctatgaaga tcctggatgg gaagaagttg tacaaggatt 
1301 acttgctgaa aatcaacttc acggctccat tcaacccaaa taaagatgca 
1351 gatagcatag tcaagtttga cacttttgga gatggaatgg ggcgatacaa 
1501 cgtgttcaat ttccaaaatg taggtgggaa gtattcctac ttgaaagttg 
1551 gtcactgggc agaaacctta tcgctagatg tcaactctat ccactggtcc 
1501 cggaactcag tccccacttc ccagtgcagc gacccctgtg cccccaatga 
1551 aatgaagaat atgcaaccag gggatgtctg ctgctggatt tgcatcccct 
1601 gtgaacccta cgaatacctg gctgatgagt ttacctgtat ggattgtggg 
1651 tctggacagt ggcccactgc agacctaact ggatgctatg accttcctga 
1701 ggactacatc aggtgggaag acgcctgggc cattggccca gtcaccattg 
1751 cctgtctggg ttttatgtgt acatgcatgg ttgtaactgt ttttatcaag 
1801 cacaacaaca cacccttggt caaagcatcg ggccgagaac tctgctacat 
1851 cttattgttt ggggttggcc tgtcatactg catgacattc ttcttcattg 
1901 ccaagccatc accagtcatc tgtgcattgc gccgactcgg gctggggagt 
1951 tccttcgcta tctgttactc agccctgctg accaagacaa actgcattgc 
2001 ccgcatcttc gatggggtca agaatggcgc tcagaggcca aaattcatca 
2051 gccccagttc tcaggttttc atctgcctgg gtctgatctc ggggcaaatt 
2101 gtgatggtgt ctgtgtggct catcctggag gccccaggca ccaggaggta 
2151 tacccttgca gagaagcggg aaacagtcat cctaaaatgc aatgtcaaag 
2201 attccagcat gttgatctct cttgcctaca atgtgatcct ggtgatctta 
2251 tgcactgtgt acgccttcaa aacgcggaag tgcccagaaa atttcaacga 
2301 agctaagttc ataggtttta ccatgtacac cacgtgcatc atctggttgg 
2351 ccttcctccc tatattttat gtgacatcaa gtgactacag agtgcagacg 
2501 acaaccatgt gcatctctgt cagcctgagt ggctttgtgg tcttgggctg 
2551 tttgtttgca cccaaggttc acatcatcct gtttcaaccc cagaagaatg 
2501 ttgtcacaca cagactgcac ctcaacaggt tcagtgtcag tggaactggg 
2551 accacatact ctcagtcctc tgcaagcacg tatgtgccaa cggtgtgcaa 
2601 tgggcgggaa gtcctcgact ccaccacctc atctctgtga 



